An Introduction to 

MODERN STATISTICAL 
METHODS 


BY 

PAUL R. RIDER 


Washington University 
Saint Louis 


NEW YOBK 

JOHN WILEY & SONS, Inc. 

London: CHAPMAN & HALL, Limited 



PREFACE 


In the past three decades enormous advances have been made 
in the field of statistics. These advances have taken place so 
rapidly that it is not at all surprising that those employing 
statistical methods have found it difficult to keep pace with 
them, or that certain of the older methods, which are obsolete, 
and even in some cases erroneous — or, at the very best, crudely 
approximative in character — continue to be taught in the class- 
room and to be treated in textbooks which appear from time 
to time. A notable example is the use of the probable error 
or standard error in testing the significance of a correlation co- 
efficient derived from a sample, although this method gives un- 
reliable or incorrect results if there is a high degree of correla- 
tion in the population from which the sample is drawn, or if 
the number in the sample is small. 

It is, of course, in the theory of small samples that the 
greatest progress has been made. The theory of sampling which 
assumes that the sample is composed of a large number of items 
is inadequate for many practical purposes. Biological, agri- 
cultural, and other scientific experiments frequently deal with 
comparatively few observations. Sometimes the cost of obtain- 
ing additional observations is prohibitive; sometimes, indeed, it 
is impossible to obtain more data,’ jis t mi^ht‘ be true in the case 
of meteorological records. In manufacturing inspection, too, 
small samples are of frequent occurrence. 

Most of this theory has been developed and unified by R. A. 
Fisher, who has shown how to make more accurate estimates 
and how to utilize the maximum amount of information con- 
tained in a set of data, and has provided exact tests of re*, 
liability and significance. Fisher's efficient methods, at first bIqw 
in taking hold, because not thoroughly understood, gr|iduaHy 
began to gain momentum, and are now spreading rapidly, * 

In this book I have endeavored to explain the most widely 
used of these methods, illustrating their application by com- 
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Work on this book was started when both of us were members of the 
staff of Iowa State College. At that time requests were received rather 
frequently from research workers. Some wanted advice on the conduct 
of a specific experiment: others, who had decided to use one of the more 
complex designs that have been discovered in recent years, asked for a 
plan or layout that could be followed during the experimental opera- 
tions. Although the logical principles governing the subject of experi- 
mentation are admirably expounded in Fisher’s book The Design of 
Experiments , these requests indicated a need for a different type of 
book, one which would describe in some detail the most useful of the 
designs that have been developed, with accompanying plans and an 
account of the experimental situations for which each design is most 
suitable. Such a book is directed at the experimenter and is intended 
to serve as a handbook which is consulted when a new experiment is 
under consideration. 

Mainly on account of the war, slow progress was made. In 1944 
we completed a mimeographed draft of which several hundred copies 
were distributed. Many helpful suggestions were made by readers. 
Of these, the most frequent was to the effect that we should include more 
material dealing with the statistical analysis of the results. In the 
mimeographed draft, our practice was to give references to worked 
examples of the analysis in cases where they could be found in the 
literature, and to present examples for only those designs whose analysis 
was not available. To this it was objected that many research workers 
did not have easy access to our references. 

This suggestion raised a difficult issue. To present a self-contained 
account of the analysis of variance in all its ramifications would make 
the book, it seemed to us, unwarrantably long and expensive. Conse- 
quently we have continued to assume that the reader has some knowl- 
edge of the principles of analysis of variance and of the computational 
methods involved. We have included a brief review of the basic theory 
and an extensive set of worked examples of the analysis for both com- 
mon and less common types of design. Although strenuous efforts 
were made to obtain a selection of examples from diverse fields of 
research, a preponderance from biology, and more particularly from 
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agriculture, was dictated by our own experience in those areas. On 
several occasions we rejected an example which would have made an 
attractive addition to the scope because we did not feel sufficiently 
familiar with the conditions of experimentation to give a realistic account 
of the problems encountered. 

Since courses of lectures on the design of experiments are being intro- 
duced in many colleges and universities, some teachers may be inter- 
ested in the potentialities of the book as a textbook. Several comments 
are prompted by our own experience in this connection. First, it will 
often be necessary for the teacher to provide a more systematic develop- 
ment of the analysis of variance than is given here. Second, interest in 
such a course is greatly enhanced by examples from an environment 
with, which the listeners are familiar, and especially by examples of 
experiments that have been conducted by some of the listeners. Thus 
the teacher is well advised to build the course around his own examples, 
using those in this book mainly as supplementary material for the 
students. Finally, a selective use of the book is in order, because it 
contains much more material than can be covered in a typical one- 
quarter course, and because some difficult topics have been dealt with 
in the early parts of the book. 

We wish to express our gratitude to the many staff members at Iowa 
State College and North Carolina State College who helped us by putting 
experimental data at our disposal, by providing painstaking descrip- 
tions of their experimental techniques, or by lending their assistance 
in the preparation of the manuscript and plans. For similar kindnesses 
in connection with certain of the examples we are indebted to 
G. F. Potter, D. Y. Solandt, D, B. DeLury and J. Hunter, F. M. Wad- 
ley and C. F. Rainwater, and the Wailuku Sugar Company, Honolulu. 
Some theoretical results used in Chapter 14 were developed from research 
conducted at Raleigh under a contract with the Office of Naval Research, 
finally, our thanks go to George W. Snedecor, who participated in the 
original plans for this book and made a careful reading of the first draft. 

W. G. Cochran 
G, M. Cox 


.January, 1050 
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CHAPTER 1 


INTRODUCTION 


1.1 The Contribution of Statistics to Experimentation 

1.11 The Problem of Interpretation. Since statisticians do not usually 
perform experiments, their claim to attention when they write on this 
subject requires some explanation. It is true that on many important 
aspects of experimentation the statistician has no expert knowledge. 
Nevertheless, in recent years, research workers have turned increasingly 
to statisticians for help both in planning their experiments and in draw- 
ing conclusions from the results. That this has happened is convincing 
evidence that statistics has something to contribute. 

At first, requests for assistance were nearly always concerned with 
the int erpretation of the results . It is a common characteristic of ex- 
periments in widely diverse fields of research that, when they are re- 
peated, the effects of the experimental treatments vary from trial to 
trial. This variation introduces a degree of uncertainty into any con- 
clusions that are drawn from the results. Even after a number of repeti- 
tions, or replications as they are called, the investigator still does not 
know by how much his results would be changed if the experiment were 
repeated further under the same conditions. Successive trials may be so 
discrepant in their results that it is doubtful which of two treatments 
would turn out better in the long run. 

As an illustration of this variation, data are given in table 1.1 from a 
simple experiment. The data are the times (minus 2 minutes) required 
to perform a routine statistical calculation, that of finding the sum of 
squares of 27 observations. Each sum of squares was computed sep- 
arately by the same person on both of two standard computing machines. 
In all, 10 different sums of squares -were worked, making 10 trials or rep- 
lications of the experiment. It will be noted that the differences in 
speed range from 17 seconds in favor of machine B to 2 seconds in favor 
of machine A. Some experimenters may comment that the results of 
this experiment are remarkably well behaved, and exhibit nothing like 
the variation with which they have to contend. The results will, how- 
ever, serve our purpose. 
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TABLE 1.1 Time in seconds (minus 2 minutes) required for computing 

SUMS OF SQUARES 

Machine 


Replication 

A 

B 

Difference 
(A - B) 

t 

30 

14 

16 

2 

21 

21 

0 

a 

22 

5 

17 

4 

22 

13 

9 

5 

18 

13 

5 

6 

29 

17 

12 

7 

1G 

7 

9 

8 

12 

14 

—2 

9 

23 

8 

15 

10 

23 

24 

-1 

Means 

21.6 

13.6 

8.0 


The object of the experiment is, of course, to compare the speeds of 
the two machines for this calculation. More specifically, two objects 
might be stated. The first is to answer the question: is there any dif- 
ference in speed? or, to put it another way, to test the hypothesis that 
there is no difference in speed. The second object, which is related to the 
first, is to estimate the size of the difference in speed. Almost all ex- 
periments are carried out for one or both of these purposes — the testing 
of hypotheses and the estimation of differences in the effects of different 
treatments. 

As regards the test of the hypothesis that there is no difference in 
speed, we might report, as relevant evidence, that B proved faster 7 
times out of 10, A twice, while once there was a tie. For the problem of 
estimation we might report that the average difference in speed in the 
experiment was 8 seconds in favor of B . These purely descriptive state- 
ments do not carry us very far. Their weakness is that they supply no 
information about the reliability of the figures presented. For example, 
have we any confidence that, if the experiment were continued for an- 
other set of 10 trials, the advantage at the end would still be close to 8 
seconds in favor of B ? 

Because of the deficiencies in the descriptive approach, experimenters 
have adopted a different point of view in the summarization of their re- 
sults. They tend to reason as follows. Suppose that it were feasible to 
continue the experiment indefinitely under the same conditions. The 
average difference in speed between the two machines would presumably 
settle down to some fixed value. This value, which will be independent 
of the size of the experiment that was actually carried out, may reason- 
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ably be called the true difference between A and B. From this point of 
view, the problem of summarizing the results may be restated in the 
question: what can we say about the true difference between A and B? 
This is a problem in induction from the part to the whole, or in sta- 
tistical language, from the sample to the population. A solution to this 
problem has been developed by means of the theory of statistics. It is 
this solution that constitutes the principal contribution of statistics to 
the interpretation of the results. 

1.12 Statistical Inference. Obviously, it cannot be expected that the 
solution will provide the exact value of the unknown true difference. As 
a less ambitious goal we might hope to be able to find 2 limits within 
which the exact value is certain to lie, but even this cannot quite be at- 
tained. What can be done is that for any chosen probability, say .95, 
two limits are found such that the probability that they enclose the true 
difference is .95. In other words, limits can be found that are almost 
certain to enclose the true difference, where the degree of certainty, as 
measured by the probability, can be chosen by the experimenter. Since 
we wish to focus attention on the type of inferential statement that can 
be made rather than on the method of calculating the limits, the com- 
putations will not be discussed at present. For the example in table 1.1 
they will be found in section 4.42. When the probability is .95, the limits 
for the true difference in speed between the 2 machines turn out to be 
3.3 and 12.7 seconds in favor of machine B. A statement that B is 
faster by an amount that lies between 3.3 and 12.7 seconds has a 1 in 
20 chance of being wrong. If the degree of certainty is decreased by 
lowering the probability to .8, the limits are narrowed to 5.1 and 10.9 
seconds. If the probability is raised to .99, the limits become 1.1 and 
14.9 seconds, and as the probability is brought closer to certainty, the 
limits steadily become farther apart. The limits are called confidence 
limits, and the probabilities are called confidence probabilities.* 

These probabilities are not merely academic abstractions: they can 
be subjected to at least a rough experimental verification. For verifi- 
cation, we need a situation where the true difference between the effects 
of two treatments is known. In toxicology, for instance, this situation 
can sometimes be obtained by diluting a standard poison to a known 
extent. The dilution is sent to the laboratory, labelled as an “unknown” 
poison. By experiments on animals, confidence limits are found for the 

* Fisher (1.2) has developed statistical inference in terms of fiducial limits. The 
two concepts, fiducial and confidence limits, have different logical backgrounds, al- 
though in all simple applications to controlled experiments the actual values of the 
fiducial and confidence limits are identical. For a discussion, see Kendall (LI). 
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amount of the unknown that has the same toxicity as a given amount 
of the standard. Since the persons originating the experiment know the 
true value for this amount, they can verify whether the statement that 
the true amount lies within the limits is correct. The practical difficulty 
with this type of check is the labor required. A large number of experi- 
ments would be needed to verify whether about 95% of the statements 
made with confidence probability .95 were in fact correct, and about 
5% were wrong. 

As wo have seen, the statistical solution to the problem of estima- 
tion consists of a statement that the true difference lies between certain 
limits, plus a probability that the statement is correct. It is of interest 
to consider whether this type of information is sufficiently precise to 
permit decisions of practical importance to be made. Although a 
thorough discussion of this question would be rather lengthy, inferences 
of this type often permit definite action to be taken with confidence 
that the action will be fruitful. When they fail to decide the point at 
issue, the reason is nearly always that the data obtained are insufficient. 
For illustration, suppose that it is desired to discover whether the ap- 
plication of a dressing of some fertilizer to a crop will be profitable. The 
cost of the fertilizer is such that its application will be profitable only if 
it increases yields on the average by 2 or more bushels. A series of ex- 
periments is carried out in order to estimate the true average responses 
to the fertilizer. If the 95% confidence limits for the increase due to the 
fertilizer are 4 and 11 bushels, its use can be adopted with a good deal 
of assurance that it will be profitable. Similarly, if the 95% confidence 
limits are —5 and 1 bushels, a decision not to use the fertilizer follows. 
A case where there is uncertainty occurs when the limits are 0 and 5 
bushels. Here it is likely that there will be either a small gain or a small 
loss, but no recommendation can be made without considerable risk of 
its being wrong. If it is important to make the correct decision, further 
experiments must be conducted in order to narrow the distance between 
the confidence limits. 

Thus far we have considered the problem of estimating the true dif- 
ference between the effects of two treatments. In testing hypotheses, 
we are interested in the supposition that the true difference has some 
specified value, most commonly zero. As in the case of estimation, diffi- 
culty arises because of the variability that is typical of experimental 
data. As a result of this variability, the data are never exactly in agree- 
ment with the hypothesis, and the problem is to decide whether the 
discrepancy between the data and the hypothesis is to be ascribed to 
these variations or to the fact that the hypothesis is not true. The con- 
tribution of statistics is the operation known as a test of significance. 
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Essentially, this is a rule for deciding, from examination of the data, 
whether or not to reject the hypothesis. Such rules are made to satisfy 
two conditions that are obviously desirable: (i) hypotheses that are true 
shall be rejected only very occasionally, and the probability of rejection 
may be chosen by the experimenter; (ii) hypotheses that are false shall 
be rejected as often as possible. 

This technique enables the experimenter to test his hypotheses about 
the action of the treatments, with the assurance that there is little risk 
of erroneously rejecting a hypothesis that happens to be correct. Prob- 
abilities of .05 and .01 are most commonly used for this risk, and in 
these cases the tests are said to be made at the 5 and 1% significance 
levels respectively. These levels are just useful conventions, and a lower 
probability may be used if the consequences of an erroneous rejection 
of the hypothesis are very serious. It should be remembered, however, 
that in lowering this probability value we automatically diminish the 
chance of rejecting a hypothesis that is false. 

A useful property of a test of significance is that it exerts a sobering 
influence on the type of experimenter who jumps to conclusions on 
scanty data, and who might otherwise try to make everyone excited 
about some sensational treatment effect that can well be ascribed to the 
ordinary variation in his experiment. On the whole, however, tests of 
significance are less frequently useful in experimental work than con- 
fidence limits. In many experiments it seems obvious that the dif- 
ferent treatments must have produced some difference, however small, 
in effect. Thus the hypothesis that there is no difference is unrealistic ; 
the real problem is to obtain estimates of the sizes of the differences. 

The construction of confidence limits may add something to the in- 
terpretation of a test of significance. For instance, suppose that the 
speeds of the 2 machines in the example had not been significantly dif- 
ferent. It is a commonplace that this result would not prove that the 
2 machines were identical in speed. However, if the 95% confidence 
limits for the difference in speed were —2 and +4 seconds, w v e might 
argue that a true difference of 4 seconds, even if it. existed, would be of 
no practical significance. Consequently, it could be said that for all 
practical purposes the 2 machines are identical in speed. This is much 
more positive and useful than the mere statement that the difference in 
speeds was not statistically significant. If, on the other hand, the con- 
fidence limits were —30 and +32 seconds, there is no justification for the 
conclusion that the machines can be regarded as equivalent. All that 
we have learned is that the data are not sufficiently accurate to show 
whether there is a difference in speed that is of practical importance. 

‘ To summarize: variability in results is typical in many branches of 
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experimentation. Because of this, the problem of drawing conclusions 
from the results is a problem in induction from the sample to the popula- 
tion. The statistical theories of estimation and of testing hypotheses 
provide solutions to this problem in the form of definite statements that 
have a known and controllable probability of being correct. These 
statements are specific enough to be useful in deciding whether action 
can be taken on the basis of the results. 

1.13 The Function of Randomization. As would be expected, the type 
of statistical inference that can be made from a body of data depends on 
the nature of the data. It is easy to conduct an experiment in such a 
way that no useful inferences can be made, and many of the experiments 
brought to the statistician, particularly in earlier years, have been of this 
type. To take a simple example, suppose that in the comparison of the 
calculating machines each sum of squares had been computed first on 
machine A and then on machine B. Now it is quite possible that in- 
creased familiarity with the data will enable the second computation to 
be done faster than the first. The advantage is unlikely to be great in 
such an easy calculation; it could be so if the computation were more 
difficult. (In the experiment this advantage was estimated at about -1 
seconds, though the confidence limits for the true advantage are rather 
far apart.) 

If the experiment is conducted in this way, the observed difference in 
speed (B — A) is an estimate of the true difference, plus the unknown 
difference in speed between a second calculation and a first. Confidence 
limits set up by statistical techniques apply not to the true difference 
between B and A but to the true difference plus this unknown advantage. 
Consequently, the limits tell us nothing definite about the true difference 
between B and A. The interpretation of tests of significance also be- 
comes confused. If A is found significantly faster than B by ordinary 
statistical tests, we can be confident that there is a real difference in 
favor of A, since A was handicapped in the course of the experiment. 
But if B is found significantly faster than A, we do not know what to 
conclude. In this case we are dealing with a bias whose nature can be 
anticipated before the experiment has started. In other experiments 
where less is known about the type of variability that is present, similar 
biases that are quite unexpected can occur from some apparently in- 
nocuous rule about the way in which different treatments are handled. 

. . In order to avoid these biases we need some means of insuring that a 
/treatment will not be continually favored or handicapped in successive 
^ replications by some extraneous source of variation, known or unknown. / 
This is done by the device known as randomization, due to Fisher. In- 
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stead of performing every calculation first on machine A, we apply the 
principle of randomization by tossing a coin to determine whether A or 
B shall be used first in any trial The decision is made independently 
in each trial The effect is that in any trial each mach ine has an equal 
chance of being tested und er the more fav o rab le conditions. Of course, 
the result of any specific randomization may favor one or the other 
treatment. But this happens only to an extent that is allowed for in the 
calculations that are used for tests of significance and confidence limits. 
This important result has been illustrated in detail by Fisher (1.2), who 
has shown how tests of significance and confidence limits can be con- 
structed, using only the fact that randomization has been properly ap- 
plied in the experiment. Randomization is one of the few character- 
istics of modern experimental design that appears to be really modern. 
One can find experiments made 100 or 150 years ago that embody the 
principles that are now regarded as sound, with the conspicuous ex- 
ception of randomization. 

The occasions on which randomization is required vary with the type 
of experiment and must be left to the judgment of the experimenter. 
One occasion arises when the treatments are allotted to the experimental 
material Suppose that the effects of different diets on the heights and 
weights of children are to be ascertained. Since different children grow 
at different rates, a treatment that happens to be assigned to a group of 
fast-growing youngsters is favored. Consequently, we allot the diets in 
any replication at random to the children who are to receive them, with 
a new random allotment in each replication. Similarly, if 4 different 
oven temperatures for the cooking of roasts are under comparison, the 
4 temperatures are assigned at random to the 4 roasts which form the 
material for any replication. Sometimes this is the only randomization 
required, but frequently other operations that are carried out in the 
course of the experiment are also potential sources of bias. With a 
repetitious operation the order of events may be important, either be- 
cause a learning process is involved which tends to make later operations 
better than the earlier ones, or because fatigue tends in the opposite 
direction. Systematic biases may be guarded against by randomizing 
the order in which the operation is performed on the different treatments 
in a replication. In other cases the equipment that is used introduces 
variation. For example, if 4 ovens are available to cook the 4 roasts, 
we would not always use the same oven for the same temperature, in 
case biases should be introduced because of systematic differences among 
the ovens. Instead, the temperatures could be assigned at random to 
the ovens in each replication. Thus we have tm randomizations, one 
to assign the temperatures to the roasts and one to assign them to the 
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ovens. If, however, we decide, before randomizing, which roast is to go 
in which oven, the two randomizations can be reduced to one. This 
method can always be used, if convenient, to cut down the nunibei of 
randomizations that must be made. 

Randomization is somewhat analogous to insurance, in that it is a 
precaution against disturbances that may or may not occur and that 
may or may not be serious if they do occur. It is generally advisable to 
take the trouble to randomize even when it is not expected that there 
will he any serious bias from failure to randomize. The experimenter is 
thus protected against unusual events that upset his expectations. Of 
course in experiments where a great number of physical operations are 
involved, the application of randomization to every operation becomes 
time consuming, and the experimenter may use his judgment, in omitting 
randomization where there is real knowledge that the results will not be 
vitiated, It should be realized, however, that failure to randomize at 
any stage may introduce bias unless either the variation introduced in 
that stage is negligible, or the experiment effectively randomizes itself. 

1.14 Restricted Randomization. We now reconsider the randomiza- 
tion proposed for the experiment on computing machines, in order to 
discuss a criticism that presumably has occurred to the reader. When 
this experiment is being planned, we possess the knowledge that there is 
some advantage to the machine on which the second calculation of any 
sum of squares is made. In the light of this knowledge, it seems wise to 
make sure that each machine receives the advantage in five of the 10 
replications, rather than to leave this to be decided by the tossing of 
coins. In fact,, if the decision is made by tossing coins, the chances are 
only about l in 4 that each machine will be used first in five replications; 
they are also about 1 in 4 that one machine will be used first in 7 replica- 
tions. The criticism is that the randomization is unlikely to give as ac- 
curate a comparison as the proposed alternative. The criticism is sound, 
and the experiment was actually planned so that each machine would 
be used first in half the replications. 

Randomization was also applied in the design actually used, though it 
is a different type from that originally proposed, being subject to the 
restriction that each machine must appear first exactly five times. The 
process of randomizing therefore consisted in choosing five numbers at 
random from the numbers 1 to 10; these turned out to be 1, 3, 0, 8, and 9. 
Accordingly, machine A was used first in replications 1, 3, 6, 8, and 9. 
This is one of the cases where it is difficult to see why failure to random- 
ize could have led to any serious danger of bias. It probably would have 
been quite satisfactory to use machine A first in the first 5 replicates, and 
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machine B first in the remainder. Nevertheless, it might happen that 
the advantage in the second calculation of a sum of squares would 
diminish in later replications; and, to save the trouble of trying to guess 
whether biases of this type are likely, the randomization was used as a 
precaution. 

The example represents a fairly common situation, in which knowledge 
of the variation that will affect the results is sufficiently detailed so that 
the simplest type of randomization is objectionable on the grounds of 
accuracy. Most of the designs in this book were constructed for this 
situation. By suitable restrictions on the randomization, these designs 
enable the experimenter to utilize any knowledge that will increase the 
precision of the experiment. In addition, each design allows sufficient 
randomization to prevent biases from sources of variation about which 
knowledge is less certain. For each of the principal designs described, 
the appropriate method of randomization is also presented. 

One further point should be mentioned. We have seen that the ex- 
periment on computing speeds could have been carried out either by 
means of the simple randomization originally proposed or by means of 
the restricted randomization which ensures that each machine is used 
first five times. Both methods provide a valid test of significance and 
valid confidence limits. However, the calculations made for the test of 
significance and for the confidence limits are different in the 2 cases. 
Hie first experiment is of the type known as randomized blocks, and 
calculations are made as described in section 4.23; the second experi- 
ment is of the “cross-over” type, for which the calculations are given in 
section 4.42. This is an instance of the general rule that the way in 
which the experiment is conducted determines not only whether in- 
ferences can be made, but also the calculations required to make them. 
The experimenter must make sure that the calculations which he uses 
are appropriate for the experiment. 

1,2 Initial Steps in the Planning of Experiments 

1,21 Importance of the Initial Steps, It has been mentioned that 
statisticians are often asked for advice in making inferences from the 
results of experiments. Since the inferences that can be made depend on 
the way in which the experiment was carried out, the statistician should 
request a detailed description of the experiment and its objectives. It may 
then become evident that no inferences can be made or that those which 
can be made do not answer the questions to which the experimenter had 
hoped to find answers. In these unhappy circumstances, about all that 
can be done is to indicate, if possible, how to avoid this outcome in future 
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M/experimonts. Consequently, it has come to be realized tha t the time to 
u v tliin ^iboui statistical inference, or t o seek adv ice, i s wh en the expei i~ 
r meat is being planned . 

Participation in the initial stages of experiments in different areas of 
research leads to a strong conviction that too little time and effort is put 
into the planning of experiments. The statistician who expects that his 
contribution to the planning will involve some technical matter in sta- 
tistical theory finds repeatedly that he makes a much more valuable 
contribution simply by getting the investigator to explain clearly why 
he is doing the experiment, to justify the experimental treatments whoso 
effects he proposes to compare, and to defend his claim that the com- 
pleted, experiment will enable its objectives to be realized. For this 
reason the remainder of this chapter is devoted to some elementary 
comments on the subject of planning. These comments are offered with 
diffidence, because they concern questions on which the statistician has, 
or should have, no special authority, and because some of the advice is 
so trite that it would be unnecessary if it were not so often overlooked. 

It is a good practice to make a written draft of the proposals for any 
experiment. This draft will in general have three parts: (i) a statement 
of the objectives; (ii) a description of the experiment, covering such 
matters as the experimental treatments, the size of the experiment, and 
the experimental material; and (iii) an outline of the method of analysis 
of the results. 


1.22 The Statement of Objectives. This statement may be in the 
form of the questions to be answered, the hypotheses to be tested, or the 
effects to be estimated. The aim should be to make the statement lucid 
and specific. The most common faults are vagueness and excessive 
ambition, in the sense that a twenty-year research program would be 
required in order to realize the stated objectives. Often it is advisable 
to classify the objectives as major and minor, because some types of 
experiment give high precision for certain treatment comparisons but 
low precision for others. When the experiment represents a cooperation 
among people of different interests, this classification is particularly 
helpful in that it makes clear which objectives take priority and helps 
to avoid an unhappy compromise that is adopted in the hope of pleasing 
everybody. 

The statement should include an account of the area over which gen- 
eralizations are to be made, or in other words, of the population about 
which it is hoped to make inferences. If an experiment is to be con- 
ducted on persons suffering from some disease, are the results presumed 
to apply to the patients in a specific hospital, to patients in all hospitals, 
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or to all sufferers whether in hospital or not? This sort of question is 
crucial in applied research, where often the experimenter has some 
definite population in mind to which he would like to apply the results. 
It is obvious that worth-while inferences about an extensive population 
cannot be made from a single experiment. For instance, inferences made 
from the experiment on the two computing machines are restricted to 
the person who made the calculations and to the type of calculation 
made. There is no guarantee that the results would be the same for a 
different type of calculation or with different operators. Consequently, 
if the object were to find out which machine is the speedier in general 
use in a computing laboratory, the experiment that was made has only 
scratched the surface of the problem. 

1.23 The Experimental Treatments. We have used the term treat- 
ments to denote the different procedures whose effect s are to be measured 
and compaml In the selection of treatments it is important to define 
clearly ’each treatment and to understand the role that each treatment 
will play in reaching the objectives of the experiment. Some issues that 
arise in particular cases are as follows. 

1. Confusion is sometimes caused by failure to distinguish whether 
the object is merely to “spot the winner” among the different treatments, 
or whether in addition it is desired to find some clues as to why the 
treatments behave as they do. A good example is the experiment which 
demonstrates, or so we are informed, that each of the three treatments, 
whiskey and water, gin and water, and rum and water, taken orally in 
sufficient quantities, produces some degree of intoxication. By itself the 
experiment provides no information as to whether the intoxication is due 
to the water, to the other ingredient, or to the fact that the two are 
mixed. A more extensive experiment with additional treatments would 
be necessary to throw some light on this question. Although there are 
occasions when it is sufficient simply to discover which is the best of the 
treatments, experience suggests that even in strictly applied research 
progress is faster if experiments supply fundamental knowledge. Sim- 
ilarly, the criticism that a certain treatment should be omitted because 
it would not be used in practice is valid if the aim is to find the best of 
the “practical” treatments, but it is not valid if the “impractical” 
treatment will supply some needed information about the behavior of 
other treatments in the experiment. 

2. The specification of the treatments may raise difficult questions 
about the conditions under which the treatments are to be compared. 
Suppose that the object is to find out the effect of an application of 
sulphate of ammonia on the yield of some crop. It is well known that 
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this effect is likely to depend on the amounts of other important plant 
nutrients that are available to the crop. Consequently, it must be 
deckled whether to add these nutrients so that they will be in abundant 
supply on all plots, or to test the nitrogen on the soil as it happens to be. 
The decision must be guided bj r the objectives of the experiment, and 
more particularly by the type of population to which inferences are to 
apply. Sometimes it is advisable to test the nitrogen in both the pres- 
ence and the absence of the other nutrients. One result of this decision 
will be that the experiment becomes also a test of the other nutrients, 
even though this may not have been part of the original plan. Experi- 
ments of this type, where one factor is tested over different levels of 
another factor, are known as factorial experiments and have come to play 
a prominent part in experimentation. They are discussed more fully in 
chapter 5. 

3. In some cases it becomes apparent that the treatments that can 
be tested in practice are not those that we would like to test. The follow- 
ing example is typical of many fields where human relations are involved. 
Suppose that it is desired to compare two methods A and B of teaching a 
foreign language, and that we have defined each method clearly and 
have decided how to measure the success achieved by each method. 
However, some of the teachers who are to participate in the test already 
use a method similar to method A and have strong beliefs that, method 
B is inferior, whereas others already follow method B and have no use 
for method .4. If the teachers are divided at random into the groups 
that are to teach the two methods, we may expect that some teachers 
will be assigned to teach a method in which they have no faith, and we 
may decide that this is not the kind of comparison that we had in mind. 
On the other hand, if we forsake randomization and allow teachers to 
use the method that they like, differences attributed to the methods may 
in fact be due to differences in the ability of the two groups of teachers. 
Given enough resources, the situation can be more thoroughly explored 
by having each method taught by those who like it, those who do not, 
and those who are neutral, so that there are six treatments instead of 
two. In practice, the resources may not permit this elaboration, and 
the issues that must be faced are how best to use the resources available, 
and whether the experiment, as it can be done, is worth doing. Here 
again, the deciding factor is usually the use to which the results are to be 
put. 

4. Discussion may arise as to the need for a “control,” this term being 
used rather loosely for a treatment in which we are not particularly in- 
vested, but which may be needed in order to reveal, by comparison 
whether other treatments are effective. Suppose that we wish to com- 
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pare the effectiveness of three treatments that are qualitatively similar, 
for example, three nitrogenous fertilizers all of which supply the same 
amount of nitrogen. The control would be a “no-nitrogen” treatment. 
Three cases may be distinguished, (i) The effectiveness of this type of 
treatment has been consistently demonstrated previously, and it re- 
mains to discover which of the three qualities is best. There is no need 
for a control, (ii) The type of treatment is in general effective, but oc- 
casionally the conditions of the test are such that it is not. For ex- 
ample, nitrogenous fertilizers may fail to produce responses in fields 
where the soil fertility is very high. In this case it will be well to add a 
control, which serves primarily to describe the conditions under which 
the fertilizers were tested, (hi) It may not be known whether the type 
of treatment is effective. The control should of course be included, and 
there may be a case for giving it more replication than the other treat- 
ments receive. An experiment of this type is analyzed in chapter 3, 
where the treatments were 4 types of soil fumigant, and the control (no 
fumigation) was replicated 1G times as against 4 for the other treatments. 
The effect is to obtain a more accurate estimate of the average response 
to fumigants, at the expense of some loss in accuracy in the comparisons 
among the fumigants. 

An interesting example of the illumination that is sometimes produced 
by the inclusion of a control has been reported by Jeliinek (1.3). A 
headache remedy contained 3 ingredients, a, 6, and c. In order to test 
whether ingredients 6 and c were necessary, the complete mixture (abc) 
was compared with (<xe) and (a&). There were 199 subjects, each of 
whom was treated with each drug for a two-week period, the appropriate 
drug being given whenever the subject complained of a headache. Suc- 
cess was measured by the ratio of the number of headaches relieved to 
the number tested in the two-week period. The mean success rates were 
0,84 for (abc), 0.80 for (he), and 0,80 for (ab); these figures show very 
little overall difference in effectiveness. 

The experiment also contained a “control” — an inert, mixture which 
looked the same as the others but had no active ingredients. This was 
tested under the same conditions as the 3 drugs. No less than 120 sub- 
jects, or about 00%, reported at least some of their headaches relieved 
by the control, a result that is itself of some interest. Further, the 
control enables us to isolate the group of 79 subjects whose headaches 
were not relieved by an inert mixture. For this group, the mean success 
rates were 0.88 for (abc), 0.67 for (ac), and 0.77 for (ab), and could be 
shown to be significantly different. As the author comments: “Banal as 
it may sound, discrimination among remedies for pain can be made only 
by subjects who have a pain on which the analgesic action can be tested.” 
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If a control is required, it must be an integral part of the experiment, 
so that results for the control are directly comparable with those for the 
other treatments. This point tends to be overlooked in experiments 
with human subjects when it is difficult or troublesome to assemble the 
desired number of subjects. For example, if a new drug is to be tested 
in some ward of a hospital, the recovery rate in the ward before the drug 
was introduced is not a satisfactory control, nor is the recovery rate in a 
different ward where patients happen to be receiving the standard drug. 
An observed difference between the effects of the new and the standard 
drug might be due to differences in the severity of the disease or in the 
type of patient or in other aspects of the medical care in the two time- 
periods or the two wards. It is necessary to regard the new and tJie 
standard drug as two experimental treatments on an equal footing, and 
to use randomization, or one of the methods of restricted randomization, 
in assigning the two drugs to patients. 

1.24 Further Details. Other features of the experiment that should 
be included in the draft of the proposals are the number of replications, 
the types of experimental material to be used, and the measurements 
that are to be made. Since these primarily affect the accuracy of the 
experiment, discussion of them is deferred to chapter 2, which is devoted 
to the general question of accuracy. 

Finally, the draft should describe in some detail the proposed method 
for drawing conclusions from the results, This is the section that is 
most frequently omitted, though it is a valuable one. It may include a 
sketch of the analysis of variance, an indication of the tubular form in 
which results will be shown, and some account of the tests of significance 
to be made and the treatment differences that are to be estimated. In 
this process we verify which treatment comparisons are relevant to 
each of the stated objectives of the experiment. Attention is drawn to 
deficiencies in the set of treatments and to treatments that supply no 
information essential to the purpose of the experiment. 
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METHODS FOR INCREASING THE ACCURACY OF 
EXPERIMENTS 


2.1 Introduction 


The results of experiments are affected n ot only by the action of the 
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^ Treatments, but al so by extraneous variations which tend tomask the 
effects of the treatments. The term ex perimental error s is often applied 
to these variations, where the word errors is not synonymous with 
“mistakes,” but includes all types of extraneous variation. Two main 


^ sources of experimental errors may be distinguished. The first is 

rv herent variability in the experimental mat erial to which the treatments 

are applied. We shall use the term experimental unit to denote the group 
of material to which a treatment is applied in a single trial of the ex- 
periment. The unit may be a plot of land, a patient in a hospital, or a 
lump of dough, or it may be a group of pigs in a pen, or a hatch of seed. 
It is characteristic of such units that they produce different results even 


when subjected to the same treatment: these differences, whether large 
or small, contribute to the experimental errors. The second source of 


variability is lack of uniformity in the physical conduct of the experi- 
ment, or in other words, failure to standardize the experimental tech- 


nique. 

Neither the presence of experimental errors nor their causes need 
concern the investigator, provided that his results are sufficiently ac- 
curate to permit definite conclusions to be reached. In many fields of 
research, however, with the time and labor that can be given to an ex- 
periment, the results are so greatly influenced by experimental errors 
that only treatment differences that are large can be detected, and even 
these may be subject to considerable uncertainty. As a consequence, 
the investigation of methods for increasing the accuracy of experiments 
has played a prominent part in experimental research in recent years. 

The methods may be classified broadly into three types. The first is 
to increase the size of the experiment, either through the provision of 
more replicates or by the inclusion of additional treatments. The second 
is to refine the experimental technique. Thirdly, we may handle the 

15 
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experimental material so that the effects of variability are reduced. 
This may be done by careful selection of the material, by taking ad- 
ditional measurements that provide information about the material, or 
filially by skillful grouping of the experimental units in such a way that 
the units to which one treatment is applied are closely comparable with 
those to which another treatment is applied. These various methods are 
discussed in more detail in succeeding sections. 

At this point it is advisable to discuss briefly the concepts accuracy and 
precision. Although these words are regarded as roughly synonymous 
in most dictionaries, they have sometimes been assigned technical mean- 
ings, particularly in the natural sciences and engineering. The difference 
between the concepts may be illustrated by an experiment in which 
children are weighed on a machine that has a bias of 1 lb. That is, if the 
true weight of a child is 46 lb., repeated weighings of the child give read- 
ings which vary around 47 lb. The accuracy of a measurement of the 
weight of the child signifies the closeness with which the measurement 
approaches the true weight, 46 lb. The term precision, on the other 
hand, is concerned merely with the repeatability of the measurements. 
Thus the precision of a measurement denotes the closeness with which 
the measurement approaches the average, 47 lb., of a long scries of 
measurements made under similar conditions. It is clear that, if the 
bias is large, a measurement may be of high precision but of low ac- 
curacy.* 

Most of the devices mentioned in this chapter, including replication, 
additional measurements, and skillful grouping of the experimental ma- 
terial, operate on the precision of the experiment. In so far as the 
method of measurement remains biased, these procedures do not affect 
this bias. A refinement in technique may reduce it. Other things being 
equal, an increase in precision is accompanied by an increase in ac- 
curacy, though if a large bias is present a substantial increase in pre- 
cision may result in a trivial increase in accuracy. 

For simplicity in presentation, we shall sometimes speak as if no bias 
is present, that is, as if precision and accuracy are identical Thus the 
phrase “true difference between two treatments” will mean the true 
difference as recorded by the measuring device that was actually used. 
This will be identical with the natural concept of a “true difference” 
only if the measurements are not biased in favor of one of the treatments. 

* For a discussion of the difficult problem of measuring precision and accuracy, 
see Shewhart (2.8). 
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2.2 Number of Replications 

Whatever the source of the experimental errors, replication of the 
experiment steadily decreases the error associated with the difference 
between the average results for two treatments, provided that pre- 
cautions (such as randomization) have been taken to ensure that one 
treatment is no more likely to be favored in any replicate than another, 
so that the errors affecting any treatment tend to cancel out as the 
number of replications is increased. The rate at which the experimental 
error is reduced is predictable from statistical theory. The basic qu an- 
tity used to measure experimenta l errors, is, Jtha,crr^ varianceper ex- 
perimental unit , which is defined as the exp ected value of the squ are 
of the error that affects the observation for a single experimental un it. 
The square root of this quantity is called the standard error per unit If 
a 2 is the error variance per unit and there are r replications, the error 
variance of the difference between the means for two treatments is 
2cr 2 /r, and the corresponding standard error is V2(7 2 /V7. This result 
is valid unless increased replication necessitates the use of less homo- 
geneous experimental material or of a less careful technique. 

In succeeding sections some advice is given on the choice of the number 
of replications for an experiment. To present realistic advice is not 
easy. Often the size of an experiment is limited by lack of resources 
or by the conflicting claims of other experiments. However, even if it is 
realized that an experiment must fall short of the precision desired, it is a 
good practice to try to estimate the degree of precision that will be at- 
tained and to present this information as part of the proposal for the 
experiment. 

2.21 Number of Replications for Tests of Significance. Consider the 
difference between the effects of a pair of treatments, which might be a 
standard procedure and some new procedure which it is hoped will prove 
better than the standard. The precision desired in an experiment may 
be specified in either of two ways. We may specify the size of the true 
difference which the experiment is to detect by means of a test of sig- 
nificance, or we may specify how closely we wish to estimate the true dif- 
ference, by stating the width of the confidence interval that we would 
like to have for the true difference. 

Although the specification used is to some extent a matter of individual 
taste, the approach by means of a test of significance is helpful mainly 
in the initial stages of a line of research. The reasoning in this approach 
is roughly as follows. If it can be established that the new procedure is 
superior to the standard procedure by at least some stated amount, say 
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20% of their mean, then we will have discovered a useful result. On 
the other hand, if the first experiment shows no significant difference 
between the two treatments, we will be discouraged from further in- 
vestigation of the new treatment. Consequently, the first- experiment 
must be large enough to ensure that if the true difference is 20% or more, 
it is highly probable that we will obtain a significant difference. Note 
that in this approach we do not insist that the first experiment give a 
precise estimate of the true difference, but merely that it give a signifi- 
cant result at the chosen level of significance. 

The probability of obtaining a significant result depends on the true 
standard error cr per unit, the number of replications r, and the number 
of degrees of freedom n that the experiment provides for estimating the 
error variance. The exact calculation of the probability is rather com- 
plicated; tables have been given by Neyman et ah (2.1) and others. The 
argument used in the following example, though logically faulty, leads 
to an approximation that is good enough for most practical purposes. 

Example. The true standard error per unit is 12% of the mean of the 
experiment, and there are 8 replicates and 28 di. for the estimate of 
error variance. What is the probability of obtaining a significant, result, 
when the true difference is 20%? 

Let d be the observed difference and 8 the true difference between the 
mean of the new treatment and that of the standard treatment. The 
true standard error of d is 



In practice we do not know cr, and in its place we use the estimated 
standard error per unit s, obtained from the analysis of variance of the 
results. We will assume that s happens to be equal to cr. 

The test of significance of d is made by means of a Student /-test, with 
28 di. To find the value that d must attain in order to he significant,, we 
multiply the standard error of d, 0%, by the significant value of L 
Suppose that the test is made at the 5% significance level, and that if is 
a one-tailed test.* The 10% value of t in the standard tables, for 

* A ono-f, ailed test, is used when it is known that the new treatment must be nt 
least as good as the standard procedure. Tim difference <1 U declared significant 
only if d is sufficiently large and pomiivc; if d is negative we ascribe the result to ex- 
perimental errors, since we do not entertain the possibility that the new treatment 
could be inferior. A two-tailed test is made when we do not know which treatment 
is better; large values of d , whether positive or negative, are declared significant. 
All standard tables of the significance levels of t are computed for two-tailed tests. 
For a one-tailed test we read the value of i corresponding to twice the significance 
probability. 
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28 elf. is 1.701, so that if cl is to be significant we must have 


<1 > /-«*! = (6)(1.701) = 10.206 


The problem is to find the probability that this happens. 
The quantity 

d-8 d - 20 

= h 


\ 


2 

“ s 
r 
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follows the ^-distribution with 28 di. But it is evident that 


d — 20 10.206 - 20 

d > 10.206 if and only if l 2 = > = —1.632 


Hence the probability which we seek is the probability that a value of 
/, with 28 di.j should exceed — 1 .632. In the table of t, the probability 
that a value lies outside the limits ±1.632 is about .11. The proba- 
bility wanted is therefore [1 — (hi) (0.11)], or about .945. It is highly 
likely that a significant difference would be found in this experiment. 

The flaw in the argument is the assumption that the value of $ can 
be assumed equal to or, whereas s is actually a random variable. If the 
argument is carried through algebraically, it gives the following rule for 
calculating the approximate probability P of obtaining a significant re- 
sult. 

Step 1. Find i 2 from the relation 


8 =* 



+ h) 


( 2 . 1 ) 


where 8 =» 

(T = 

r — 
h « 


Step 2. 


true difference that it is desired to detect, 
true standard error per unit, 
number of replications. 

significant value of t in the test of significance (with care to 
distinguish between one-tailed and turn-tailed tests). 

P - l - 0 4)V2 (2.2) 


where p% is the probability corresponding to t% in the ordinary stable. 
The degrees of freedom in h and t 2 are those available for the estimate of 
error variance. 
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TABLE 2.1 Number of replications required for a given probability of 

OBTAINING A SIGNIFICANT RESULT 

Upper figure: Tost of significance at 5% level, probability 
Middle figure: Test, of significance at 5% level, probability 00'V 
Lower figure: Test of significance at 1 % level, probability ito'.c 

One-tailed tests 
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The rule may be inverted in order to give the number of replications 
r required for a given probability P of old uining a significant result. In 
this form we have „ 

^ " 2 G) ^ + h) ~ ( 2A) 

where 5 = true difference that it is desired to detect, 

<r = true standard error per unit. 
k = significant value of i in the teat of significance. 
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TABLE 2 J (Continued) 
Two-tailed tests 
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True standard error per unit (<r) as percent, of the mean 
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Notes, In constructing the table, it was assumed that the number of degrees of 
freedom for error is 3(r — l); this would apply in a randomized blocks experiment 
with 4 treatments. 

No entries are given where more than 50 replications are required. 


The use of this equation is slightly tricky, because the number of 
degrees of freedom in t% and depends on r. Trial and error may be used 
until the smallest r satisfying (2.3) is found. 

Example . In the previous example, how many replications are re- 
quired hi order to have a chance of 4 in 5 that a significant result will be 
obtained? Here we have 6 = 20, <t = 12, and we will suppose that the 
experiment has 4 treatments in randomized blocks, which gives n « 
3(r — 1) degrees of freedom for the estimate of error. In the 2-table, 
it is the value for probability .10, while fe is the value for probability 
2(1 - .80) or .40. 



22 


2.2 


METHODS FOE INCREASING ACCURACY 

In order to start the calculation some guess must be made about th 
value of n. It does not matter greatly if the guess is inaccurate; we wil 
try n * 30. For this, 

t x « 1.697; fa -854; so that (h + fe) - 2.551 

Hence 

r > 2(i^o) 2 (2.551) 2 - 4.7 

Consequently our second trial value is r ~ 5. Since this provides only 
12 instead of 30 di., we must repeat the calculation to verify whether 
5 replications are sufficient. Fox 12 di. (h + fe) is 2.055. Thus 

2 (^j (h + t%) 2 — 2 ^ (2.655)" = 5.08 

Since this is greater than 5, we conclude that 5 replications are not quite 
enough, and that 6 is the smallest number of replications that will 
satisfy the conditions. We can be sure without further computation 
that 6 will satisfy the conditions, because the right-hand side of the in- 
equality always decreases when we increase r . 

Table 2.1, computed by this method, shows the minimum number of 
replications for a range of values of 8 and cr, and is intended to be used 
in the planning of experiments. Usually it is most convenient to think of 
8 and a expressed in percentages of the mean per observation for the 
experiment, and the range in the table has been constructed with this 
in mind. Since the value of r depends only on the ratio <r/8, it does not 
matter what units are used for 8 and cr, provided that they are the same 
for both. There are three entries for any given pair of values of 8 and cr. 
These show, in descending order, the replications required for a test of 
significance at the 5% level and a probability .80 of getting a significant 
result, for a 5% level and probability .90, and for a 1% level and prob- 
ability .95. The top half of the table applies to one-tailed tests, the 
lower half to two-tailed tests. 

Construction of the table requires some assumption about the size of 
the experiment. In the ease presented the experiment- has four treat- 
ments in randomized blocks, so that n « 3(r - 1). The values in the 
table change little if the experiment happens to provide more than this. 
If the experiment has only two or three treatments it is advisable to 
check the value of r by means of the formula. 

In the practical use of the table it is necessary to estimate the value of 
the true standard error per unit. Often this can be done fairly well from 
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the results of previous experiments with the same kind of material. It 
should he realized that the probabilities to which the table applies will 
be correct only if the value of or is estimated correctly. If a is under- 
estimated, the probability of obtaining a significant result will of course 
be smaller than the presumed value. For cases where it is highly im- 
portant that the probability be correct, more exact methods are dis- 
cussed in section 2.23. 

As an illustration of the reading of the table, consider the case in the 
last example, where B = 20, a = 12, P = .80, and the test is a one-tailed 
test at the 5% level. For this we read the top entry in the top half of 
the table for 5 = 20, <r = 12, and find that 0 replications are necessary. 
From the middle figure we see that an increase to 7 replications in- 
creases the probability to .90. If the test is to be two-tailed, we read at 
the corresponding place in the lower half of the table, and find that 7 
replications are required for a probability .80 and 9 for a probability .90. 

The table brings out the value of any reduction in the standard error 
per unit. In fact, we cannot have a high probability of detecting a 5% 
difference with any reasonable number of replications unless the stand- 
ard error per unit is 4% or under. Differences of 20% or over can be 
detected in most cases without excessive replication. No entries have 
been inserted in the table for situations where more than 50 replications 
would be required, since experiments of this size are uncommon. 


2.22 Number of Replications for Prescribed Limits of Error. The 
confidence limits for 8, the true difference between the effects of two 
treatments, are 


8 « d i 



(2.4) 


where t is the value in the stable corresponding to the confidence prob- 
ability chosen and to the number of degrees of freedom n in the estimate 
of error. The quantity V2s$/ V r may be called the limit of error , since 
it measures the maximum distance between d and 8 for the confidence 
probability chosen. Since the limit of error depends on s, it is not known 
until the experiment is completed. If s is replaced by its average or 
expected value, we obtain the expected limit of error 



(2.5) 
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The factor / enters because the average value of s is slightly less than <r ; 

to a close approximation, / equals ^1 — -~j- 

From relation (2.5) table 2.2 was constructed to show the numbers of 
replications required for various values of L and for confidence probubil- 

TABLE 2.2 Number op replications reoutred for given limits of error in 

THE DIFFERENCE BETWEEN TWO TREATMENTS 

Upper figure: Confidence probability .80 
Middle figure: Confidence probability .90 
Lower figure: Confidence probability .95 


Limit of 
error (zhL) 
as percent 
of the mean j 


True standard error per unit ( 
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itiea of .80, .90, and .95. As in table 2.1, the experiment was assumed to 
have 3 (r — 1) degrees of freedom in the estimate of error; no entries were 
inserted where more than 50 replications are necessary, and the values 
of L and o- are expressed as percentages of the mean for the experiment. 
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This table may be used in the planning of an experiment where the 
object is to estimate the difference in the effects of two treatments. Sup- 
pose that the true standard error per unit is expected to be about 6%, 
and that preliminary experiments have indicated that the true difference 
between the effects of the two treatments is around 25%. In this case 
we would probably not be satisfied with a value of L any higher than 
5%; if L is 5% and if the observed difference is found to be 22%, we 
are confident that the true difference lies between (22 ± 5)%, that is, be- 
tween 17 and 27%. From the entry for a = 6, L = 5, we find that 6 
replications suffice for a confidence probability of 80%, 9 for 90%, and 
12 for 95%. 

It is important to reiterate that this table does not guarantee that the 
limits of error will be of the desired size, even if the value of c r is fore- 
casted correctly. The actual limits will depend on 5 and may be either 
larger or smaller than L. They will be about the desired size on the 
average if <r is guessed correctly. Further, the table makes it clear that 
small treatment differences of the order of 5% cannot be accurately 
estimated in a single experiment, at least with the values of a and r that 
normally prevail in many lines of research. For an accurate estimate of 
a 5% true difference, we would probably not want L any larger than 1%. 
This value was not included in the table since it demands great numbers 
of replications, 

2.23 The Case Where Additional Assurance Is Desired. We have 
seen that tables 2.1 and 2.2 do not ensure that the desired standards of 
accuracy will always be attained. In table 2.1 the probability of obtain- 
ing a significant result is as stated only if the value of <r is correctly 
guessed. In table 2.2, if cr is correctly guessed, the limits of error will 
equal the desired L on the average, but for a single experiment may be 
either higher or lower. In certain circumstances, methods are available 
which provide a more definite assurance. 

If an initial estimate s{ 2 of cr 2 , based on rq degrees of freedom, has been 
obtained from preliminary data, Harris, Horvitz, and Mood (2.3) present 
tables that give the number of replications required so as to be certain 
that the probability of obtaining a significant result will have the de- 
sired value. No guesswork about cr is involved, though if n x is small the 
number of replications required tends to be large so as to allow for the 
possibility that s% 2 is a very poor estimate of <r 2 . With the same pre- 
liminary data a value of r may be found such that the probability is 
.05 that the limits of error exceed L. If in default of an estimate $i 2 we 
can give a range of values within which a 2 is likely to lie (e.g., from pre- 
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vious experiments), the authors suggest a simple method for using their 
tables. 

An alternative approach, due to Stein (2.4), is applicable when the ex- 
periment can be interrupted after a given number of replications. This 
approach ensures that the limits of error will not exceed L. The pro- 
cedure is as follows. We first complete a number of replications some- 
what less than the number which we expect will be necessary. At this 
stage we compute the limits of error. If these are already less than L, 
the object is achieved. If they are greater than L, Stein gives a simple 
rule for computing the additional number of replications needed to give 
limits that are less than L. 

Naturally, these methods require more replications than tables 2,1 
and 2.2. In fact, if used repeatedly, they give a standard of accuracy 
which on the average is greater than we desire, in order to ensure that 
the standard is seldom if ever less than what is wanted. For this rea- 
son, tables 2.1 and 2.2 appear to be suitable for ordinary experimenta- 
tion; the methods described here should be used when it is very impor- 
tant to reach a specified standard of accuracy. 

2,3 Other Methods for Increasing Accuracy 

2.31 The Measurement of Relative Efficiency, We have seen that 
the error variance of the difference between the means for two treat- 
ments is 2cr 2 /V, where r is the number of replications. This result pro- 
vides a simple means for comparing, in concrete terms, the relative 
precision of two experiments. Suppose that one design gives a true error 
variance of 1.0 per experimental unit, while a second gives 0.5. The 
two experiments would give equally precise comparisons among the 
treatment means if the amount of replication in the first were twice that 
in the second. From a knowledge of the error variances per unit, com- 
parisons are thus easily made in terms of the relative amounts of replica- 
tion required .to attain the same degree of precision. The inverse ratio 
of the variances per unit is sometimes called the relative efficiency of the 
two designs; in the example above, the relative efficiency of the second 
experiment to the first is 2,0 or 200%. Similarly we might speak of the 
relative gain in efficiency as 1.0 or 100% in this case. These terms occur 
frequently in the literature on experimental designs. 

One further factor must be taken into account. For the same numbers 
of treatments and replicates, the number of degrees of freedom in the 
estimate of error changes with the design. This number enters into many 
of the uses to which the experimental results are put. When the number 
of degrees of freedom becomes smaller, the limits of error for a true 
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difference are increased and the probability of obtaining a significant re- 
sult is decreased ; in other words, the sensitivity of the experiment is de- 
creased. 

Thus the numbers of degrees of freedom for error are relevant to the 
comparison of two designs. A change in design which decreases the error 
degrees of freedom as well as the error variance may not be advantageous. 
This leads to the question: by how much must the experimental error 
variance be reduced so as to balance a given reduction in the error degrees 
of freedom? The question has been discussed by Neyman et al (2.1), 
Fisher (2.5), and Walsh (2.6). The answer depends on the use to which 
the results are put. From one point of view, two experiments may be 
regarded as equal in sensitivity when the limits of error for a given con- 
fidence probability are equal. Alternatively, equal sensitivity may 
imply an equal probability of detecting as significant a given real dif- 
ference between the effects of two treatments. Other definitions could 
be set up. 

Table 2.3 illustrates the situation. An experiment with a true error 
variance per unit equal to 1 (i.e., with an infinite number of degrees of 
freedom) was chosen as the standard. The table shows, for a range of 


TABLE 2.3 Effect op number of degrees of freedom in error on the sen- 
sitivity OP THE EXPERIMENT. ERROR VARIANCES PER UNIT WHICH GIVE EQUAL 
SENSITIVITY ACCORDING TO FOUR DEFINITIONS 


Number of error degrees of freedom (n) 
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4 
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0.40 
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0.67 
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0.91 

0.93 
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values of the error degrees of freedom (n), the true error variances per 
unit which are required to equal the standard in sensitivity according 
to four definitions of this term. In case 1 the average limits of error for 
a confidence probability .95 are made equal. In case 2 the same cri- 
terion is used for a confidence probability .80. In case 3 all experiments 
have the same probability .25 of detecting as significant a constant dif- 
ference between two treatments. Case 4 shows the same comparison for 
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a larger constant difference where the probability is .75. The tests of 
significance are one-tailed /-tests at the 5% level. 

For example the figure 0.70 for n = 6 (definition 1) means that with 
a true error variance per unit of 0.70 and 6 d.f. in the estimate of error, 
the limits of error for a confidence probability .95 are the same on the 
average as when a 2 is 1 and n is infinite. A 30% decrease in error vari- 
ance is needed to compensate for the lack of degrees of freedom. 

The wide divergence between the results for definition 3 and defini- 
tion 2 is noteworthy since the only change in criterion is that from a 
confidence probability .95 to one of .80. Definitions 3 and 4 exhibit 
much closer mutual agreement and give results intermediate between 
those of definitions 1 and 2. In practical applications the discrepancies 
among the four methods will be smaller than table 2.3 suggests, be- 
cause comparable designs for the same experiment seldom differ greatly 
in the degrees of freedom available for error. 

Fisher’s approach (2.5) is somewhat different. In effect, he calculates 
the “amount of information” which the estimated difference d between 
two treatment means supplies about the true difference 8, the calcu- 
lation being made from the /-distribution of the quantity 

Vr(d -8) 

' X4 “ 

He finds the information to be (n + !)/(??, + 3),r, whereas if <r were 
known exactly the information would be 1 /a 2 . The estimated variances 
which give equal amounts of information are shown in table 2, !. 

TABLE 2.4 Estimated variances which provide equal amounts of 

INFORMATION 


Number of error degrees of freedom (n) 
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This table agrees closely with the results for cases 3 and 4, except 
when n is below 5. For general purposes it is suggested that this table 
be used to take account of the differences in degrees of freedom for error 
in two designs that are being compared. Suppose that a design with 
n — 6 is compared with one in which n = 12. The former gives more 
information only if the error variance s x 2 is less than (0.778/0.867)s 2 2 . 
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Thus the relative efficiency of the first design to the second is estimated 
as 0.77 Ss2 2 /0.S67$i 2 , or more generally as 

(fti + l)(n 2 + 3 )$o 2 

(ft 2 + l)(ft'l + 3)6‘! L> 

The adjustment is of importance only if ni and n 2 are small. 

This expression applies to the simple case in which the two experi- 
ments have the same set of treatments. The experiments may, however, 
have different sets of treatments, although each provides an estimate of 
some specific treatment effect in which we are interested. Alternatively, 
a different method of estimation of the treatment effect may have been 
employed in the two experiments. In such eases we may wish to assess 
the relative accuracy with which this treatment effect is estimated in 
the two experiments. The expression given above still applies, except 
that 8% 2 and s 2 2 become the estimated variances of the treatment effect 
as found in the two experiments, while Hi and n 2 are as before the num- 
bers of degrees of freedom in sff and sff respectively. 

In succeeding sections a number of other devices which affect the 
accuracy of an experiment are presented briefly. 

2.32 Selection of Treatments. In certain cases the selection of the 
treatments lias a substantial effect on the precision of an experiment. 
As Fisher has stressed, striking gains in precision may be achieved by 
testing different types of treatment in the same experiment, instead of 
conducting a separate experiment for each type. For instance, one 
type of treatment or factor might be the depth of ploughing in the 
preparation of a field for a crop, while another might be the addition of a 
nitrogenous fertilizer. We might conduct one experiment in which 
deep (D) and shallow (S) ploughing are compared, and another in which 
no fertilizing (0) is compared with the addition of a specified amount of 
fertilizer (N), In a factorial experiment both factors would be tested 
simultaneously by means of the 4 “treatments” — (DO), (SO), ( DN ), and 
(SN), where the symbol (DO) implies deep ploughing with no addition 
of nitrogen, etc. The average response to nitrogen is assessed by com- 
paring the last two treatments with the first two. This comparison 
gives as precise an estimate of the average response to nitrogen as if the 
whole experiment had been devoted to that factor alone. The same 
property holds for the estimation of the average difference in effect be- 
tween deep and shallow ploughing. Thus the experiment, as it were, 
does double duty. The use of factorial experimentation is discussed 
more fully in chapter 5. 

There are many specialized problems where the choice of the proper 
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amounts of some ingredient is important. As a very simple example, 
suppose that the response to increasing amounts of the ingredient is 
known to be linear and that the purpose is to determine the slope of the 
line. The most accurate experiment contains only two amounts oi 
ingredient. These should be placed at the ends of the range wit hin which 
the response is linear and experimentation is feasible. With three 
amounts of ingredient, two at the ends and one in the middle, the 
variance of the estimated slope, for the same total size of experiment, is 
\ l /2 times as large, and if more amounts are used it becomes larger still, 
A more complex example occurs in the estimation of the amount of a 
virus in a solution by preparing a series of dilutions of the solution and 
counting the numbers of lesions produced on leaves which are rubbed 
with the dilutions (2.7). The accuracy of the estimate is known to 
depend both on the dilution ratio that is chosen and on the number of 
dilutions that are used. 

2.33 Refinements of Technique. Since technique is the responsibility 
of the experimenter, its importance need not be elaborated. The prin- 
cipal objectives of a good technique are as follows. 

i. To secure uniformity in the application of the treatments. In 
pig-feeding experiments, for instance, a uniform amount of food cannot 
be supplied to each animal without the provision of individual fowling 
boxes. In the testing of insect sprays, delicate apparatus is required 
in order to subject each batch of insects to a desired dose of spray. 

ii. To exercise sufficient control over external influences so that 
every treatment produces its effects under comparable and desired con- 
ditions. It is difficult to generalize about the degree of cent rul needed ; 
a balance must be struck between the cost incurred and the gain in 
precision obtained. The artificial production of diseases for experiments 
on resistance to infection exemplifies a case where experimentation can- 
not proceed rapidly without such control over external conditions. 

in* To devise suitable unbiased measures of the effects of the treat- 
ments. Often the appropriate measurements are readily apparent; 
sometimes the development of a satisfactory method of measurement 
requires years of research, as in the estimation of certain of the vitamins, 
in soil analysis, and in sociological investigations. 

iv. To prevent gross errors, from which no type of experimentation 
seems to be entirely free. Adequate supervision and checking of the 
work of assistants and a scrutiny by the experimenter of the data from 
every experimental unit will go far towards the discovery and rectifica- 
tion of errors. 

Faulty technique may increase the real experimental errors in two 
ways. It may introduce additional fluctuations of a more or less 
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random nature. Such fluctuations, if they are substantial, should reveal 
themselves in the estimate of error as calculated in the analysis of 
variance (chapter 3). Where his estimated standard errors are con- 
sistently higher than those of other workers with similar material, the 
experimenter is advised to seek the reason, which may lie in differences 
in technique. In addition, faulty technique may result in measurements 
that are consistently biased. The estimate of error does not take ac- 
count of such biases, since it is derived from comparisons of the measure- 
ments with one another; in other words, it estimates precision rather 
than accuracy. The principal safeguards against such biases are care 
and skill in the construction and handling of measuring devices, plus the 
intelligent use of randomization. 

It is worth while to consider from time to time whether simplifications 
can be brought into the technique without undue loss of accuracy. 
Many experiments involve chemical determinations which vary little 
from unit to unit within the same treatment. For instance, in sugar- 
beet experiments, the experimental error of the amount of sugar per 
acre is due mainly to the error in the weight of roots per plot. The 
amount contributed by the sugar content percent is nearly negligible. 

With compound measurements of this type, the labor devoted to the 
less variable component can be reduced with little loss of accuracy. If 
the chemical measurement is expensive, a considerable saving in cost 
may be made. Tests of significance of the chemical composition need 
not be sacrificed. For example, if the experiment has six replicates, a 
chemical analysis might be made for each treatment on bulked material 
from the first three replicates and another on bulked material from the 
last three replicates. These data provide a valid test of significance based 
on two “replicates.” 

2.34 Selection of Experimental Material. The choice of the experi- 
mental unit may be of importance. In the planning of field experiments 
numerous studies have been made of the variability among crop yields 
on plots of various sizes and shapes under uniform treatment. From 
these data the best size and shape are selected. The criterion should 
be to obtain the maximum accuracy for a given expenditure of time and 
labor. 

Frequently, uniform material is prepared specially for experimental 
purposes, as in the development of inbred lines by laboratories engaged 
in animal experimentation. Alternatively, the experiment may be con- 
fined to a sample chosen for its homogeneity from a large batch of ex- 
perimental material If the results of the experiment are to be applied 
to unselected material, these types of specialization have potential dis- 
advantages, Responses obtained to the treatments in highly selected 
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material may not be the same as the responses in unselected material. 

Selectivity in the material is difficult to avoid where experiments are 
conducted in the field of economics or sociology. In testing some method 
of farm management, for instance, the experiment may require the active 
participation of a group of farmers. The success of such experiments 
depends greatly on the tact and resourcefulness of the investigator in 
persuading farmers to cooperate so that the participants are a represent- 
ative sample of the population about which generalizations are to be 
drawn. 

2.35 Additional Measurements. In the course of an experiment it 
may be possible to take supplementary measurements which predict at 
least to some extent the performances of the experimental units. In an 
experiment to measure the effects of different diets on the weights of 
children, their weights at the start of the experiment would be a supple- 
mentary measurement of this kind, since the increase in weight, of a 
child during the experiment is probably correlated with his initial 
weight. 

By a technique known as the analysis of covariance, we may estimate 
from the data the extent to which the observations were influenced by 
the ' variations in these supplementary measurements. The average 
response to each treatment can then be adjusted so as to remove the 
experimental error arising from this source. Tn the school experiment, 
the adjusted responses to treatments represent approximalely the re- 
sponses that would be obtained if all the children had the same initial 
weight. Thus the effects of variations in the initial weights are largely 
eliminated from the experimental error, without the necessity of equal- 
izing the initial weights in the planning of the experiment. 

Since the analysis is a statistical technique of comparatively recent 
origin (it was first presented by Fisher about 1932), it may be unfamiliar 
to many investigators who could use it to obtain a substantial increase 
in precision with little effort. Its purpose is to remove experimental 
errors arising from extraneous sources of variation which it is impracf ieal 
or too costly to control by a more refined technique. Cases are fairly 
common where the use of covariance has more than doubled the effi- 
ciency of the experiment. The extra work involved consists in taking 
the subsidiary measurements and in applying the adjustments to the 
treatment means. An example of the computations is given in sect tun 
3.8. 


2.36 ^ Planned Grouping: Complete Blocks. Finally, we may attempt 
to minimize the experimental errors through the choice of the experi- 
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mental plan. The prospects of increased accuracy by this means have 
been widely explored during the past twenty years. This book is in- 
tended as a source of reference to the numerous procedures that have 
been developed. 

The basic idea is simple. Consider an experiment of which a number 
of separate replications have been conducted. The experimental errors 
of the results from any replicate can arise only from sources of variation 
that affect the units within that replicate. Consequently, the error of 
the difference between two treatment means taken over a number of 
replicates must also arise solely from variations within the individual 
replications. Variations from one replicate to another do not contribute 
to the errors. In carrying out an experiment we utilize this simple fact 
by trying to control sources of variation that affect different units in the 
same replicate. We need not attempt to reduce differences among the 
replicates. For example, if the experimental units form a very heter- 
ogeneous batch, we attempt to group them so that units in the same 
replicate are similar; we do not worry if the units in one replicate are 
not at all similar to those in another. By this device, precise experi- 
ments can often be made from what at first sight appears an unpromising 
batch of material. Similarly, if a uniform experimental technique cannot 
be maintained throughout the experiment, the important point is to 
keep the technique uniform within a replication; changes should be 
made when moving from one replicate to another. This type of design, 
known, in agriculture as randomized blocks , is discussed in section 4,2. 

The idea is carried a stage further in the latin square (section 4.3). 
The treatments are arranged diagrammatically so that each appears once 
in every row and once in every column of a square array. Variations 
among the groups of experimental units which correspond to the rows 
and also among those which correspond to the columns are eliminated 
from the experimental errors. For example, in a comparison of 5 dif- 
ferent -ways of performing some manual operation, the rows may repre- 
sent 5 operators, each of whom carries out all the methods in turn. The 
columns, numbered from left to right, may prescribe the order in which 
each operator performs the 5 tasks. The plan of a latin square design 
for this experiment might read as follows. 

Order of procedure 

Operator (1) (2) (3) (4) (5) 

I D A B E C 

11 A B € D B 

III B E A € D 

I V C D B B A 

V E C DAB 
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The letters A , B, C, D, E represent the 5 methods. Thiw operator I 
carries out method D first, method A next, and so on. TV ith this ar- 
rangement, the experimental errors are unaffected by differences among 
the abilities of the men and also by systematic variations introduced by 
the order in which the methods are performed. If the opportunity 
presents itself, this type of control may include a third factor by the use 
of a graeco-latin square (section 4.5). 

2.37 Planned Grouping: Incomplete Blocks. In the randomized block 
design, the precision depends to some extent on the number of treat- 
ments. As this number increases, it becomes more difficult to keep the 
variations between experimental units within each replicate small and 
the error variance per unit tends to increase. In experiments with a 
large number of treatments, numerous attempts have been made to 
avoid this loss of precision by the formation of groups of experimental 
units which do not contain all the treatments and therefore can remain 
small. These groups are called incomplete blocks. The groups are so 
constructed that we can still remove the effect of the differences among 
groups from the experimental errors, just as such differences are aut o- 
matically removed in the randomized blocks design. With incomplete 
blocks, however, this removal usually requires the application of adjust- 
ments to the treatment means, so that the statistical analysis of the re- 
sults becomes more complex. 

The best method of constructing the incomplete blocks varies with the 
nature of the experiment. Sometimes certain comparisons among treat- 
ments are of greater interest than others. For instance, in a field experi- 
ment containing a number of early-maturing and an equal number of 
late-maturing varieties of a crop, comparisons between pairs of varieties 
which mature at about the same date will presumably be more important- 
than comparisons between an early and a late variety. An incomplete 
block might contain either all the early or all the late varieties, a pair of 
blocks forming a complete replication. This arrangement, of which 
several variations are possible, is known as the split-plot design (section 
7.1). 

A similar situation occurs in factorial experiments. Certain com- 
parisons, e.g. ? the average effects produced by varying one factor, are 
nearly always of immediate interest. Others, which represent rather 
complex relationships among the effects of different factors, may be of 
minor importance, often because previous experiments have shown that 
these interrelationships, called interactions , exert little or no influence. 
It has been found that, by a deliberate sacrifice of precision in the 
estimates of these interactions, the size of the group or block can be re- 
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dueed so as to produce an increase in the precision with which the average 
effects of the factors are estimated. This principle, known as con- 
founding, can also be combined with the use of the latin square (chapters 
6 and 8). 

If all comparisons between pairs of treatments are potentially of equal 
importance, a different method is used in forming the blocks. When 
treatments are arranged in incomplete blocks, two treatments which oc- 
cur in the same block are more precisely compared than two which are 
placed in different blocks, at least in so far as the investigator has suc- 
ceeded in decreasing the variation within blocks. Therefore the goal is to 
construct- a design such that any pair of treatments occurs equally often 
within some block. This type of problem has been investigated pre- 
viously for its mathematical interest. A solution can be found for any 
number of treatments and any size of block, but most of the solutions 
require too many replications for the usual conditions of experimentation. 
The solutions available to date (called balanced incomplete block designs 
(chapter 11)) are given for all cases in which the number of replications 
does not exceed ten. Here again the latin square principle can some- 
times be used (chapter 13) to eliminate variation amongst two different 
types of grouping of the experimental units. 

For a given number of treatments and a given size of incomplete 
block, balanced designs allow little choice in the number of replications. 
Thus, with !‘>4 treatments and blocks of 8 units each, 9 replications are 
required to balance the design. In order to provide designs for small 
numbers of replications, a number of additional types have been de- 
veloped : these are similar to balanced designs except, that they lack 
complete symmetry. 

One group, called lattice designs (chapter 10), has been used widely 
in agricultural experiments during the past 5 years. For these designs, 
the number of treatments must be an exact square, e.g., 25 or 49, while 
the number of units in the block is the corresponding square root, 5 or 
7, respectively. A further group, known as cubic lattices (section 10.4), 
is useful when the number of treatments exceeds 100 yet it is desirable 
to have a small block. In this case the number of treatments is the cube 
of the number of units per block, and the number of replications may be 
any multiple of 3. 

2.38 Summary. There are numerous methods available for increasing 
the accuracy of an experiment. Frequently the same end may be 
reached in several different ways. Thus, in the experiment which tests 
the effects of a number of diets on the weights of children, the disturbing 
effects of variations in their initial weights can be reduced by selecting 
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for the experiment a number of children of approximately the same initial 
weight, or by the use of the analysis of covariance, or by a planned 
grouping in which children within the same replicate are of approxi- 
mately equal initial weight, or simply by having a large number of 
replications. The method adopted should be that for which the desired 
standard of accuracy can be attained with the smallest expenditure of 
time and effort. There is no special merit in either a complicated ex- 
perimental plan or a. highly refined technique if equally accurate results 
can be secured with less effort, in some other way, A good working rule 
is to use the simplest experimental design that meets the needs of the 
occasion. This is not to say that the more complex designs will be used 
only rarely; in fact, they have already demonstrated their utility in 
fields of research where no other equally practicable method appears to 
exist for reaching the same results, 

2,4 The Grouping of Experimental Units 

2.41 Investigation of Methods for Grouping, In order to take full 
advantage of the opportunities for increased precision by suitable group- 
ing of the units, the investigator must know the best criteria for group- 
ing. Frequently these are suggested from previous observations on the 
experimental material or on disturbing factors that become evident in 
the course of an experiment. Where knowledge is less definite, as in a 
new line of research, there are two ways in which information may be 
accumulated. 

One is to devote an experiment specifically to this problem. In this 
case it is usually advisable to subject all units to the same treatment, 
carrying out a so-called uniformity trial The word uniformity refers to 
uniformity of treatment; the experimental material and technique should 
be representative of those which are used in actual experiments. The 
amount of material should be sufficient to enable the experimenter to 
superimpose on the results a hypothetical trial of the size which he cus- 
tomarily performs. It is worth while to record any auxiliary measure- 
ments which might predict the performance of the experimental units. 
From the results of these trials any proposed grouping of the units can 
be formed and the amount of variation within and among the groups 
can be calculated. If it is desired to compare different experimental 
plans, each may be superimposed on the results of the uniformity trial 
By the method indicated in section 2,31, the investigator may estimate 
the relative numbers of replications that are needed to reach the same 
degree of precision with the various plans. 

Secondly, useful information can sometimes be secured without any 
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special trial or technique by examination of the results of actual experi- 
ments. With an experiment which is suitably designed, it is possible to 
estimate from an analysis of variance of the results what the error vari- 
ance would have been if any particular grouping had not been used. This 
estimate, being derived from the same data, is directly comparable with 
the error variance obtained by the use of the grouping and enables the 
investigator to evaluate the success of the grouping. Consequently, for 
experiments in a new line of research, the experimenter may try any 
method of grouping that might be effective, in the knowledge that its 
appropriateness will be revealed by the experiments themselves. In- 
structions for performing such comparisons, which are easily made when 
the analysis of variance has been completed, are given in the notes which 
accompany each type of design. 

2.42 Criteria for Grouping. Many factors have been used as the basis 
for grouping. In agricultural experimentation it has been repeatedly 
demonstrated that plots close together tend to be more similar in their 
yields than plots farther apart. The almost universal practice is to put 
neighboring plots in the same group, with the groups approximately 
square in shape whenever practicable. Contiguity frequently forms a 
good basis for grouping in greenhouse experiments also, where differences 
may exist along the bench in temperature, sunlight, air currents, or ac- 
cessibility for watering. Alternatively, in many plant experiments, 
and more particularly in animal experiments, some characteristic of the 
plant or animal is much more useful, the physical location of the ex- 
perimental units throughout the course of the trial having a relatively 
minor influence on the results. Age, weight, vigor, sex, and genetic con- 
stitution are some of the factors most commonly used. The animal it- 
self may constitute the block in cases where the treatments can be applied 
in succession to the same animal without producing residual effects which 
obscure the results. 

A single grouping, as in the randomized block design, can be used to 
eliminate simultaneously variation from a number of different sources. 
For example, suppose that an experiment were planned to investigate 
whether some treatment applied to rats enabled the animals to with- 
stand subjection to a dose of a poisonous gas. The replication would 
contain two animals, a treated rat and an untreated rat to be included 
as a control. Given a sufficient stock of experimental animals, the pairs 
in any replication might be animals of the same litter (and consequently 
of the same age), of the same sex, and of approximately the same weight 
and vigor. If it were convenient to test two animals at the same time, 
the pair could be put into the gas chamber together, thus eliminating 
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also the effects of variations in the strength of the dose or in the time for 
which it was supplied. 

The freedom with which the size of block can be varied depends on 
the type of research. In field experiments the block can be laid down so 
as to accommodate any number of treatments, though, as previously 
stated, the standard error per plot may be expected to increase slowly as 
the size of block increases. When the use of designs with incomplete 
blocks is contemplated for field experiments, the principal question is to 
decide for what number of treatments the reduction in block size be- 
comes worth while. On the other hand, the number of leaves on a plant, 
or of animals of the same litter, usually varies within rather narrow 
limits. In such cases, where the most homogeneous grouping fees the 
size of the block, incomplete block designs may be of special interest. 

The gains in efficiency obtainable by skillful construction of the 
blocks, when measured in terms of increased replication, are often large; 
examples can be cited in which the effective replication has been in- 
creased from two- to tenfold. Frequently, apart from the preliminary 
care and thought involved, an accurate design uses the same material 
and involves the same physical operations as the design which it. replaced. 
In some of the more complex designs the computations required to es- 
timate the treatment averages and to perform tests of significance be- 
come rather laborious and involved. The experimenter's reaction to 
these difficulties will depend on his aptitude for numerical computations. 
At a first trial considerable time may be spent in mastering some of the 
methods, but the processes become easier with familiarity. 
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CHAPTER 3 


NOTES ON THE STATISTICAL ANALYSIS OF THE RESULTS 

3.1 Introduction 

With regard to the statistical analysis for a design, the procedure 
adopted in this book is to present the computations, with worked nu- 
merical examples for the designs likely to be most frequently used. To 
supply a complete mathematical justification of the analysis for each 
type of experiment, although in many respects desirable, would have 
unduly increased the amount of material. The same general technique 
governs the analysis of all designs, differences arising only in the adapta- 
tion of the technique to the particular structure of a design. In the 
present chapter this technique— the method of least squares — is de- 
scribed and applied to one of the simplest designs. With perseverance 
the method may be used to verify the computing instructions for other 
designs in this book. It should be pointed out that the computing meth- 
ods that are easiest in practice are not necessarily those that flow from a 
straightforward application of the theory, so that at first sight some of 
the instructions may appear different from those given by theory. 

Some account will also be given of certain statistical techniques, such 
as the technique for missing values and the analysis of covariance, that 
are particularly useful in handling the results of experiments, though 
they are perhaps too specialized to be included as part of a general in- 
troductory course in statistics. 

3.2 The General Method of Analysis 

3.21 The Mathematical Model. The illustrative data come from an 
experiment conducted in 1935 by the Rothamsted Experimental Station 
(3.1). The object of the experiment was to measure the effectiveness of 
4 soil fumigants in keeping down the numbers of eelworms in the soil. 
The fumigants were chlorodinitrobenzene (CN), carbon disulphide jelly 
(CS), and two proprietary preparations, “Cymag” (CM) and “Seekay” 
(GK). Each fumigant was tested both in a single and a double dose. 
These comprize 8 distinct treatments. The control (no fumigant) was 
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the ninth treatment. Normally the replication would have consisted 
of 9 plots, but in this case 4 control plots were placed in each “replicate," 
which actually contained 12 plots. The purpose was to supply a fairly 
accurate standard against which to measure the performance of the 
fumigants. 

The experiment was laid out in 4 separate blocks of 12 plots each, so 
that there were 4 replications for each dose of each fumigant and 16 
replications of the control. Within a block the 12 “treatments" were 
allotted to individual plots at random. The experiment is of the 
familiar type known in agriculture as randomized blocks (see chapter 4). 
The fumigants were ploughed in during spring, after which a crop of oats 
was sown. After harvest, a sample of 400 grams of soil was taken from 
each plot and the number of eelworm cysts counted for each sample. 
The data are those labelled “second count" in table 3.1. 


TABLE 3.1 Plan and numbers of eelworm cysts per 400 grams of 
SOIL, first count above, second below 


Block 1 
Totals 
2587 
4383 


Block 3 
Totals 
1743 
4752 


0 

2 OK 

ION 

1CM 

2CM 

2CS 

2CK 

0 

269 

283 

252 

212 

95 

127 

80 

134 

466 

2S0 

398 

380 

199 

160 

142 

590 
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0 

0 

2CM 

1CK 

ION 

ICM 

0 

138 

100 

197 

263 

107 

89 

41 

7-1 

194 

219 

421 

879 ■ 

236 

332 

176 

137 

2CS 

1CK 

0 

2CN 

0 

0 

2CN 

ICS 

282 

230 

216 

145 

88 

25 

42 

62 

372 

256 

708 

304 

356 

212 

308 

221 

1CTC 

0 
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2CK 

2CK 

0 

ICE 

ICM 

124 

21 1 

194 

222 

193 

209 

: 109 

153 

268 

505 

433 

408 

202 

352 

: 132 

4o4 

0 

2CN 

20S 

! 1CN 

0 

' 2CN 

2CS 

0 

102 

193 

128 

42 

29 

9 

17 

19 

363 

561 

311 

222 

254 

92 

28 

106 

2CM 

0 

1CM 

0 

ICS 

1CN 

0 

2GM 

162 

191 

107 

67 

23 

19 

44 

48 

365 

563 
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338 

80 
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268 

298 


Block 2 
Totals 
904 
3075 


Block 4 
Totals 
872 
2470 


_ a formal analysis of the results the first step is to set up an equa- 
tion for every observation. This equation expresses the observation as 
the sum of four components: (i) a general average about which the 
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observations are presumed to be fluctuating; (ii) a component repre- 
senting the effect of the treatment applied; (iii) a component represent- 
ing certain environmental effects which the design of the experiment 
enables us to isolate; and (iv) a residual component, representing all 
other sources that influence the observation, and generally referred to as 
the “experimental error.” 

Component (iii) cannot be fully appreciated without some knowledge 
of the technique employed in the construction of designs. This technique 
consists in arranging the conduct of the experiment so that, in the sub- 
sequent analysis of the results, differences among certain groups of ob- 
servations can be measured. What is more important, the effects of such 
differences cun be eliminated from the estimates of the treatment effects. 
If this elimination were not feasible, these differences would contribute 
to the “experimental error,” component (iv), with the consequence that 
less accurate estimates of the treatment effects would be obtained. In 
the present example, as will be seen, these differences are differences 
among the groups of 12 observations that constitute a block. With 
each design that we describe, the environmental components that are 
eliminated in this way are pointed out. 

We proceed to write down the mathematical model as applied to the 
example. Instead of numbering the observations from 1 to 48, it is con- 
venient to use a subscript i to denote the treatment applied, j to denote 
the block in which the observation lies, and k to denote the order within 
the block. For the plots that receive fumigants, the subscripts i and j 
are sufficient to define each plot uniquely. The subscript k is needed to 
distinguish between different control plots within the same block. In 
this notation the equation for any observation may be written 

Vijk — g + ?i + ft + Oijk (3.1) 

where g represents the general mean, r* the effect of the treatment, ft 
that of the block, and €#& the experimental error. 

3,22 Assumptions Made in the Model. There is already implicit in 
the model one assumption that may not be true in the data obtained 
from experiments. This assumption is that the treatment effect and the 
block effect are additive > If we take the difference between two observa- 
tions in the same block, say and we have, as a deduction from 
the model, 

Viihi ~~ V2jh ^ n - t 2 + e ljkl - e 2M (3.2) 

The block effects have disappeared from the right-hand side; that is, 
the model implies that the difference between the true effects of two 
treatments is the same in all blocks. On a block with a high eelworm 
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infestation a successful fumigant is presumed to reduce the eelworm 
numbers by the same amount as on a block with a low eelworm infesta- 
tion. This is unlikely to be so, except as a first approximation. Little 
systematic study of the accuracy of this approximation in practice has 
been made. From general experience it appears that the approximation 
works well in a large number of experiments; nevertheless, cases are not 
infrequent where the approximation is poor. 

In addition, several assumptions are made about the residual effects 
djk' These are taken to be independent from observation to observation, 
and to be distributed with zero mean and the same variance a 2 . Further, 
for tests of significance and the estimation of confidence limits, the 
are assumed to follow the normal or Gaussian frequency distribution/ 
In practice, these assumptions are only approximately fulfilled. The 
consequences of errors in the assumptions will be indicated later (section 
3.9). 

Apart from changes in detail, the type of model and assumptions 
presented for this example apply to all experimental arrangements in- 
cluded In this book, except for two types of design for which a more 
complex model is required. These are the split-plot designs (chapter 7), 
where two or more different error variances are postulated, and in- 
complete block designs (chapter 9), where additional assumptions are made 
about the nature of the environmental effects 

3.23 Estimation of the Treatment Effects* When the assumptions in 
the previous section are valid, there is a well-known result in statistical 
theory that the best estimates of the unknowns p, rp and are obtained 
by the method of least squares . This method chooses estimates m, ip and 
bp respectively, which minimize the stun of squares of the residuals 

Z (2 Hik ~m - U - bj) 2 (3.3) 

taken over all observations. 

In order to find these minimizing values we differentiate the sum of 
squares with respect to each unknown in turn, and set the derivative 
equal to zero. Consider a specific unknown, e,g., If any square con- 
tains ip its derivative is 

-Kvm - m -U — bj) 

If the square does not contain tp the derivative is zero. Hence, if the 
derivative of the sum of squares in (3.3) is set equal to zero, the resulting 
equation may be written 

E ivm ~ m ~ h - bj) ® 0 

or 

Z 0 n + U + b § ) =Z Vi» (3-4) 
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where thr sum is over all observations whose equations contain or 
more generally over all observations whose equations contain the param- 
eter to be estimated. 

Equations (3.4) are called the equations of estimation, or the normal 
equations. As we have seen, the equation of estimation for any param- 
eter is constructed by equating the total of the observed values to 
the total of the expected values (m + + hj) over all observations whose 

equation contains the parameter. 

We now write down some normal equations for the example. Since 
m occurs in all 48 observations, its normal equation is 

48 ni ~b 4 .(/ 1 4~ fit -j- * * * + fg) \Qtg 4" 12 (bi 4" 4“ * * • + b 4 ) — G (3.5) 

where denotes the control treatment, and G the grand total of all 48 

observations. At this point a simplification may be introduced. I11 
the model the quantities U are needed only to indicate by how much an 
individual treatment differs from the average of all treatments in the 
experiment. Accordingly the model is not changed in any essential 
feature if we assume that the mean of all the ti is zero. It may also be 
shown that any convenient linear function of the U may be assumed to 
be zero. The same remarks apply to the block values bj. From the form 
of equation (3.5) it appears that convenient assumptions of this kind are 

b 4- (2 4 Ms 4- 4f$ = 0 (3.6) 

h\ 4” b% 4“ b% 4“ ^ 0 (3.7) 

With these assumptions equation (3.5) reduces to 48m = G , so that 
m becomes the mean value of all observations in the experiment. 

By summing over the observations that receive a particular treat- 
ment G we find that its normal equation is 

4m 4~ 4 U + h+ h + h + b 4 ~ Ti (i — 1, 2, • • • , 8) 

and for the control, 

16m 4~ ltifc 4- 4(6i + h 2 + 63 4~ 64) = Tq 

where the Ts denote observed treatment totals. From relation (3,7), 
these equations reduce to 

U - ~ - m (< - 1, 2, • • 8); f, - ~ - m (3.8) 

4 1G 

As might be expected, the effect of any treatment is estimated by 
taking the difference between the mean of the observations which re- 
ceive that treatment and the general mean. 
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Let us express these estimates in terms of the original model (3.1). 
By adding the relevant equations we find 

Ti « 4/Z + 4 Ti + (fil + & + 03 + $ 4 ) + 23 ( 'ijk 

Tq = 16ju + 16 tq + 4(/3j. + + 0s + A*) + 23 e Wk 

Hence, using (3.8), 

m + = }l + T{ + iifti + &2 + As + d + t-i 

where et is the average of the residual errors on plots that receive treat- 
ment i. For the difference between a pair of treatments we deduce 

ti ~ t u = (Ti — T«) + (Si — C n ) (3.9) 

This shows that the estimated difference equals the true difference phis 
an error of estimate (e* — e u ). Note that the error of estimate does not 
contain any of the environmental (block) effects arising from compo- 
nent (iff). This result exemplifies a valuable property of least, squares 
solutions, namely, that the estimate of the difference between the ef- 
fects of any two treatments is not influenced by the environmental ef- 
fects, but only by the residual effects e#*. It is in this sense that we say 
that environmental effects are eliminated from the est imat es of I he treat- 
ment effects. 

Estimates of the block or environmental effects are usually of less 
interest. It may be verified that the least squares estimate of fa is 



where is the observed block total. 

3.24 Tests of Significance and Confidence Limits. When the estimates 
of the treatment effects have been obtained, the details of the subsequent 
study of them vary with the type of experiment. Usually, however, 
some of the following procedures form the basis for the final conclusions: 
(i) a test of significance of the null hypothesis that the effects r; and 
r u of two treatments are identical, or, more generally, of the hypothesis 
that some linear combination of the iy is zero; (ii) the construction of 
confidence limits for the difference (r,- — r u ) between the effects of two 
treatments, or for a linear function of the treatment effects; (iii) a test 
of significance of the null hypothesis that a group of treatments r u 
r 2 , *",t p all have identical effects. In this section the theory govern- 
ing these procedures is presented briefly, without proof. The arith- 
metical methods are described in the next section. 
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TEBTH OF 


H1GNIFI0ANCE AND CONFIDENCE LIMITS 


i. l**f *>J' MtjhijiaincG of the difference between two treatment effects , 
From equation (3AJ) we have 


k — in * (n ~ r u ) + {Bt ~~ e u ) (3.9) 

Hinrr 1 1 j o r s are normally distributed with zero means, it follows by 
theory that if( — i u ) normally distributed about the true difference 
(Ji h * urlher, tlie variance of (U ~~ t u ) may be shown to be 



where /v and r, f are the numbers of replications of k and respectively. 
Binco it h useful to be able to calculate the variance of any comparison 
in winch we are interested, the rules for doing this are given in section 


3.5. 

In order to proceed we need an estimate of the experimental error 
variance From theory, the best estimate is known to be 

bjijk - rn ~ U ~ bjf 


where t ho residual sum of squares is taken over all observations. The 
divisor n r , culled t he number of degrees of freedom (d.f.) in the estimated 
error, is given by the rule 

n € -- ; Total number of observations) — (number of independent 
parameters that were estimated) 

The word independent is introduced because we nearly always include 
in the original model more parameters than axe strictly necessary. In 
the example we found that we could assume one linear relation (3.6) 
among the Ts and one (3.7) among the Fs. Consequently the number of 
independent parameters is 

1 (for in) + 8 (for the fs) + 3 (for the 6’s) » 12 

Since there are 48 observations, n e is 36. 

Hence, if the null hypothesis is true, he., r** = r u> then (f* — t u ) is 
normally distributed with mean zero and estimated variance 



The rut io 


tf tu 
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is known to follow Student’s /-distribution with n e , or 36 d.f. This is the 
quantity used for a test of significance of the null hypothesis. 

With a more complicated function of the estimated treatment effects, 
say L - (witi + w 2 t 2 4 f- wA), where the w’s are any set of num- 

bers, the procedure is essentially the same. The estimated variance of 
L is s 2 f, where / is a numerical factor found by the rules given in section 
3.5. For a test of the null hypothesis that the true value of L is zero, we 
use the ratio L/sVf, which is distributed as Student’s t with n e degrees 
of freedom. 

ii. The construction of confidence limits. The theory for a test of sig- 
nificance also leads to confidence limits for the unknown true difference 
in - r„). For, when r,- and r u are not necessarily equal, the quantity 

(U — t u) — (Vi — r u ) 
s 

may be shown to follow Student’s /-distribution with n e degrees of free- 
dom. Hence the confidence limits are given by 

/r — r 

(u — r«) = (/j — t u ) rfc sd(n,,, a) I 1 

\ n r u 

where d(n e , a) is the value of Student’s t corresponding to w t . degrees of 
freedom and the chosen confidence probability «. A similar method is 
used for the more general function L. 

Hi. A test of the identity of a group of treatment effects. Quite frequently 
the first test to be made is that of the hypothesis that all h treatments 
have produced identical effects, i.e., that all n are equal. In other cases 
we are more interested in the hypothesis that some subgroup of the treat- 
ments, say treatments 1 to p, has produced the same effects. We con- 
sider the latter test, since it reduces to the first test if p is put equal to k. 

This test is provided by a general theorem that has many applications. 
First, we find the least squares estimates of all. treatment and environ- 
mental effects in the usual way, and compute the residual sum of squares 

Si 2 = H (Vm ~ m - ti - fey ) 2 

Next we start again and rewrite the mathematical model, inserting the 
restriction that tj = r 2 ~ • • • = t p . That is, wherever any of these 
t { appears in the equation for an observation, we replace the r { by a 
common symbol, say r'. The least squares estimates are computed for 
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3 . 2 .'! 

this new set. of equations, using the same observations. Again we com- 
pute the residual sum of squares which may be denoted by 

‘V* - £ (ym - m' -t’~ b/f 

This will always be found to be at least as large as A'i 2 . Finally, the 
theorem states that, if the null hypothesis is true, the quantity 

a 2 a 2 ct 2 

V - 1 * Vo 

follows Snedeeors /‘'-distribution with (p — 1) and n e degrees of freedom* 
Although the procedure may appear rather complex, the test criterion 
used is a reasonable one. S 2 is the sum of squares of deviations of the 
observations from the values predicted for them by the original model, 
so that it measures how closely the original model agrees with the data. 
*% 2 plays the same role for the restricted model. Consequently, if S% 2 is 
much larger than we are inclined to think that the restricted model 
does not fit the data nearly so well as the original model, and therefore 
to reject the null hypothesis that the restricted model is the correct one. 
However, the value of (*S 2 2 — S{ 2 ) alone does not provide a measure of the 
improvement in the lit with the original model If, for instance, this dif- 
ference is 10, then, other things being equal, we should regard the im- 
provement in fit as greater when Si 2 is 1 than when Si 2 is 1000. It is this 
type of etmsidemtion that leads to the use of the ratio (S 2 2 — $i 2 )/$i 2 
as the essent ial part of the test criterion. 

Although the theorem postulates two separate sets of least squares 
solutions, nearly all designs are constructed so that only one set of solu- 
tions must be found in practice. A number of designs which could have 
been included in this book were omitted because the least squares 
analysis seems too cumbersome for frequent use. 

3.25 The Analysis of Variance, All the procedures (1-tests, F- teats, 
and construction of confidence limits) use the residual sum of squares, 
which will often be called the error sum of squares. This quantity could 
1)0 found by calculating for each observation ytjk the value (m + q + fy) 
predicted by the least squares solution. The sum of the squares of the 
differences between observed and predicted values could then be ob- 
tained. This method is slow, and the error sum of squares is much more 
quickly computed by a technique known as the analysis of variance. 

In the original model, each observation is represented as the sum of 
four components due respectively to the general mean, the effect of the 
treatment, the environmental effect, and the residual effect. In the same 
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way the analysis of variance partitions the sum of squares of the ob- 
servations into four sums of squares, one attributable to the general 
mean, one to differences between the estimated effects of the treatments, 
one to the environmental effects which the experiment is capable of 
measuring, and lastly one which is the residual or error sum of squares. 
In most cases we compute the original sum of squares and the first three 
components, obtaining the error sum of squares by subtraction. 

The analysis of variance provides much more than a short-cut method 
of securing the error sum of squares. The sum of squares due to treat- 
ments is the quantity ($ 2 2 — Si 2 ) needed for the F~test of the hypothesis 
that no differences exist between the effects of the treatments. By a 
slight extension the analysis also supplies the sum of squares required 
for testing the equality of the effects of a subgroup of the treatments. 
The component due to environmental effects enables us to estimate by 
how much the accuracy of the experiment has been increased by eliminat- 
ing these effects from the estimates of the treatment means. 

The analysis of variance depends on a number of algebraic relations 
which will be illustrated for the eelworm experiment. From section 3.23 
it will be recalled that the least squares normal equation for any un- 
known, say U, was 

E (Uijk - m~ U - bj) = 0 

summed over all observations which received the ith treatment. If we 
multiply this equation by U and add the equations for different treat- 
ments together, we obtain the equation 

2 kiyiik - m - u ~ bj) = 0 (3.10) 

where the sum is now over all observations, since every observation is 
associated with one and only one Similarly we may establish the re- 
lations 

E Mvak — — bj) - 0 (3.11) 

E bjiyijk w 1% bj) =0 (3.12) 

where both sums extend over all observations. 

A few more relations of this type are required. If we add the estimates 
U over all plots in the jth block, the sum is 

& + h + %+••• + ts) + 4% 

But this is zero because of equation (3.6) in which a linear relation among 
the t’s was introduced. Hence, if we multiply by b 3 - and add over all 
blocks, we establish the relation 

E bjk = 0 


(3-13) 
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where the sum is again over all observations. By similar arguments we 
prove the additional relations 

E mb, = 0; = 0 (3.14) 

These relations lead to the partition of the sum of squares of the ob- 
servations. Write 

Ifijk = m + k + bj + (ijiju — m — ^ — bj) 

Square both sides, and add over all observations. The six relations 
(3.10) to (3.14) show that all six sums arising from cross-produet terms 
on the right-hand side add to zero. Consequently, we have the following 
analysis of sums of squares. 

E vm 2 = E + E k 2 + E h 2 + E ivm - *» - k - bjf (3.15) 

This equation is the basis of the analysis of variance of the results. 


3.26 Application to the Example. To obtain the left-hand side of 
(3.15) in practice, we simply compute the sum of the squares of all ob- 
servations. For the eelworm data this will be found to be 5,481,198. 
The components due to the mean, to treatments, and to blocks are not 
usually calculated from (3.15) as it stands, because it is quicker and more 
accurate to obtain them from totals rather than from the estimated ef- 
fects. 

Thus, for the mean, the contribution is 48m 2 , since there are 48 ob- 
servations. If G is the grand total of all observations, 14,680, this con- 
tribution may be written G 2 / 48, or (14,680) 2 /48, which amounts to 
4,489,633. This term is sometimes called the correction for the mean. 
Written in full, the sum of squares for treatments is 

i(h 2 + i£ + * • • + ti) + 16fe 2 (3.16) 

But from (3,8), 

Ti T 9 

ti « m (i = 1, 2, * * *, 8); t 9 = m 

4 16 

where Ti is the observed treatment total. By substitution, (3.16) be- 
comes 

y[(Ti — 4?u ) 2 + (Ta — 4m ) 2 + * * ■ + (Tg ~~ 4m) 2 ] + y 1 ^ (T 9 — 16m) 3 


When each parenthesis is expanded, we have 


Ti*+T 2 * + -"+T* 


2 

-* + 



2m(Ti H* T 2 * * * "F Tg T 9 ) T 48m 3 


4 
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Now the sum of the observed treatment 
or 48m. Consequently the last two terms 
give 

t 1 2 + t 2 2 +---+ t & 2 

— -+• 

4 


totals Ti is the grand total G, 
above may be amalgamated to 

— - 48m 2 (3.17) 

16 


Note that the last term is the correction for the mean, already found to 
be 4,489,633. Expression (3.17) is an example of the more general ex- 
pression 

k rp 2 

Zt- c 

isss 1 'i i 

which gives the treatments sum of squares when the 7th treatment is 
replicated u times, C being the correction for the mean. 

To obtain the numerical value we first calculate the treatment totals 
as shown in table 3.2. 


TABLE 3.2 Treatment totals from table 3,1 


Level of 


application 

CN 

cs 

CM 

OK 

Totals 

0 


5858 


5858 

1 

1066 

628 

1431 

892 

4317 

2 

1265 

877 

1241 

1122 

4505 


The treatments sum of squares is given by 
(1066) 2 + (12G5) 2 + ■•• + (1122) 2 _ (5858) 2 
4 + ” 10 


- 4,489,633 = 1.57,448 


The blocks sum of squares, h 2 > can likewise be expressed in terms 

of block totals as follows. 


Bf + Bo 2 -f Bf + B£ 
12 


C 


(4383) 2 + (3075) 2 + (4752) 2 + (2470) 2 


12 


4,489, G33 « 289,427 


where the observed block totals have been inserted from table 3.1. The 
divisor 12 is the number of observations per block. 

The complete analysis of variance is shown in table 3.3. It is not 
customary to display the correction term for the mean, since this is of no 
particular interest. The item labelled “total” at the foot of the table is 
the original sum of squares, minus the correction for the mean (5,481 ,198 
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— 4,489,(533 = 991,565). This may be shown to be equal to the sum of 
squares of deviations of the observations from their mean; i.e., 

X Vijk 2 - 48m 2 = X (Uijk ~ m)‘~ 


TABLE 3.3 Analysis op variance op the data in table 3.1 



Degrees of 

Sum of 

Mean 


freedom 

squares 

square 

Source of variation 

(di.) 

(s.s.) 

(m.s.) 

Treatments 

8 

157,448 

10,681 

Blocks 

3 

280,427 

06,476 

Error (by subtraction) 

36 

544,600 

15,130 

Total 

47 

991,565 



The column which has been computed above is, of course, the column 
labelled “sum of squares.” The other two columns remain to be ex- 
plained. The “degrees of freedom” associated with any component are 
the number of independent parameters required to describe that com- 
ponent in the model. In the case of treatments, this always equals one 
less than the number of treatments, and similarly for blocks. For the 
total s.s., the number is the number of observations (48) less one repre- 
senting the contribution of the mean. 

The degrees of freedom have two principal uses. First, by subtraction, 
they give the degrees of freedom for error (30). As mentioned in section 
3.24, this is the divisor needed for the error s.s. in order to estimate the 
error variance a 2 . The estimate is 544,690/36, or 15,130, which is called 
the error mean square and is shown in the right-hand column of table 3.3. 
Second, the degrees of freedom for treatments are used in an F-test of 
the hypothesis that all k treatments produced the same effects. In 
section 3.24 the value of F for testing this hypothesis was given as 

n S 2 2 ~ S x 2 St 2 
k — 1 n e 

The denominator S\ 2 /n e is the error m.s. In the numerator the quan- 
tity ($ 2 2 — Si 2 ) may be shown to be the treatments s.s., while (ft — 1) 
is the number of degrees of freedom associated with treatments. Thus 
the numerator is 157,448/8, or 19,681, called the treatments mean square : 
The F-ratio for this experiment is 19,681/15,130, or 1.30, with 8 and 36 
di. 

Tn the same way the mean square for blocks forms the numerator for 
the F-test of the hypothesis that all block effects are identical. Although 
this test is rather seldom of interest, the mean square for blocks has other 
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uses. By formulae which will be given with the individual designs, this 
mean square leads to an estimate of the error variance that would have 
been obtained if the experiment had not been grouped into blocks. 

With some designs the structure of the analysis of variance is more 
complicated than in the example above. What happens is that the 
treatment and environmental contributions to the sum of squares are 
entangled. If we calculate the treatments s.s., a different result is ob- 
tained according to whether we assume that block effects are present or 
absent, and vice versa for blocks. In such cases the usual procedure is 
to present the blocks s.s. as calculated when treatment effects arc ig- 
nored, and the treatments s.s. as calculated when block effects are taken 
into account. These items are sometimes called “blocks, ignoring treat- 
ments” and “treatments, eliminating blocks,” respectively. If calcula- 
tions are made in this way, the two terms may still be added so as to 
obtain the error s.s. by subtraction; further, the treatments s.s. is still 
the appropriate one for an F-test of the treatment effects. 

This presentation of the analysis of variance is necessarily very in- 
adequate. For further reading, reference should be made to Fisher 
(3.2, 3.3) and Snedeeor (3.4), whose discussions involve little math- 
ematics. For presentation of points of theory, see Kendall’s book (3.5) 
and papers by Fisher (3.6, 3.7) and Yates (3.8). Unfortunately, no single 
reference contains a really comprehensive account of the subject. 

3.3 Accuracy in Computations 

3.31 Original Records. If there exists a series of rules which will 
guarantee that all computations are accurately done, we do not know 
them. The following notes may help in maintaining a high standard of 
accuracy. 

The first place where errors may occur is in the original records. These 
should be made in a clearly legible and permanent form. Where scale 
readings are being taken, the habit of checking each reading immediately 
after it has been written down is a useful one. As the observations are 
being recorded, any that appear anomalous should be examined and the 
reason sought. Gross errors are often eliminated in this way. If on 
cheeking such an observation no error or explanation is found, a note to 
this effect should be made. Such notes are helpful in situations where 
the later analysis is done by a different person. Most statisticians who 
analyze other people’s experiments have encountered the perplexing 
problem where an observation is so completely out of line that a gross 
error is suspected, yet the original records contain no comment about 
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the observation, and the recorder’s memory of the circumstances in 
which it was taken has faded. 

Copying from the original records to a form suitable for analysis may 
also introduce errors. Since cross-checking is rather a soporific task, mis- 
takes tend to be overlooked even when two persons are used for the 
check. Sometimes foresight can eliminate the need for copying. 

3.32 Checks in the Analysis of Variance. The analysis of variance it- 
self consists of two sorts of operations. First, a number of totals are 
formed. These are nearly all self-checking, for the fact that the treat- 
ment totals and the block totals both add to the grand total may be ac- 
cepted as proof of their correctness. This is not a watertight check, since 
compensating errors in two treatment totals pass undetected. Also it 
does not ensure that treatments and blocks are correctly identified, and 
cases have occurred where treatment totals were mistakenly labelled as 
block totals, and vice versa. But apart from such rare events the sum 
check is as satisfactory as any. 

The second stage, the computation of the various sums of squares, is 
not self-checking and must be checked by recomputation. The most ac- 
curate method is to have two computers carry out the analysis inde- 
pendently, comparing results only when both have finished the task. 
The practice of having one computer “check” the results of the other, 
though more convenient, always seems to produce a certain number of 
errors. This is to be expected, because, when the first calculations are 
nearly always correct, as they should be, it is difficult for the checker to 
avoid the presupposition that they are correct, with resulting mental 
laziness. Recomputation by the same person, though often unavoidable, 
is probably less accurate than the two preceding methods. 

An expert who wishes to do so can make the analysis self-checking. 
For instance, the error s.s. can be computed independently of the other 
sums of squares, sometimes quite easily and sometimes with more diffi- 
culty. If this sum of squares agrees with that found by subtraction, all 
the items in the “sums of squares” column may be regarded as correct, 
apart possibly from errors in labelling. However, with the computa- 
tional methods that we present, checking is required except in a few cases 
specifically noted. 

The steps in the preparation of summary tables (e.g., calculation of 
treatment means from totals) are usually partly but not entirely self- 
checking. For example, the fact that treatment means average to the 
general mean forms a check. Care should be taken to recompute those 
steps that are not self-checked. 
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With the more complex designs, where treatment means have to be 
adjusted in order to eliminate the environmental effects, errors have 
arisen because the computing instructions were misunderstood. For 
instance, the adjustments made were ten times as large as they should 
have been. A person thoroughly familiar with the design would have 
known that the adjustments were ridiculously large, but someone new 
to the design may have no idea what size of adjustment it is reasonable 
to expect. This danger with complex designs should not be ignored. It 
can be minimized by careful study of the method of analysis for any 
design that it is proposed to use. 

3,33 Number of Figures to be Retained, Since the primary purpose 
of the analysis of variance is to obtain an accurate value for the error 
m.s., the number of figures which it is worth retaining during the cal- 
culations depends on the size of this figure. This is not known in ad- 
vance, but often a rough idea of the coefficient of variation (ratio of the 
standard error to the mean of the experiment) is available from previous 
experience. For the original data from which the analysis of variance is 
computed, a crude rule which errs on the safe side is as follows. Record 
the original data to 4 significant figures if the coefficient of variation is 
between 0.4 and 4%, to 3 if it is between 4 and 40%, and to 2 if it exceeds 
40%, Let us apply this rule to the eelworm data. Since experiments on 
eehvorms are not very common, there is not much basis on which to 
predict the coefficient of variation. Suppose that we guess that it is un- 
likely to be below 20% (it actually turned out to be 40%). Then, for 
safety, data should be recorded to 3 significant figures. Since the average 
eelworm number per plot is around 300, this means that data are re- 
corded to the nearest eelworm, which is as accurately as they could be 
recorded. 

A more precise rule, requiring a little calculation, is that the rounding 
interval should not exceed one-quarter of the standard error per observa- 
tion. To apply this rule to the eelworm data, starting with the same 
premise that the coefficient of variation is 20%, we note that the mean 
per plot in the experiment is about 300. Thus the predicted value of the 
standard error per observation is about 60, so that the rounding interval 
should not exceed 15. It will be quite satisfactory to round each ob- 
servation to the nearest 10 eelworms. The discrepancy from the result 
given by the previous rule illustrates the fact that the first rule tends to 
be too conservative. 

The advisability of deliberately rounding records already taken should 
be left to the judgment of the experimenter. With modem calculating 
machines it may be more expeditious to carry out the computations with 
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a few unnecessary figures than to take time to perform and check the 
rounding. The rules are most likely to be useful where computing de- 
vices are poor or unavailable. 

In the analysis of variance itself, it is generally as well to carry the full 
number of figures obtained from the uncorrected sum of squares; e.g., if 
the original data contain one decimal place, the sum of squares and hence 
the analysis of variance will contain two decimal places. This number 
will usually be excessive, but no more time is likely to be wasted in 
writing down the unnecessary digits than in estimating how many digits 
should be retained. 

In the final presentation of results, on the other hand, superfluous 
digits should be strictly avoided. They make the conclusions more 
difficult to grasp, impede rapid mental comparisons, and give some 
readers an erroneous impression of the accuracy of the results. A treat- 
ment mean should be rounded to one-tenth of its estimated standard 
error; that is, if the estimated standard error of the treatment means is 
2.50, the means could be rounded to 0.256 and should be rounded to 0.1, 
he., to one decimal place, since this is the nearest rounding interval that 
is convenient. In some experiments (e.g., those of the split-plot type) 
different treatment comparisons have different standard errors. Here 
the rounding interval should be decided from those comparisons that 
have the smallest error. 

3.34 Identification of Data. The practice should be followed of label- 
ling all data, from original records to summary tables, so that another 
person at a later time can tell what the data mean. Much data, ex- 
pensive to collect and potentially valuable for some research, have had 
to be discarded or destroyed because the original collector cannot be 
reached, and new investigators are unable to discover what measure- 
ments were taken and under what circumstances. Sometimes the units 
in which the observations or summary tables are presented cannot be 
found, or the crop on which a field experiment was carried out, or the 
nature of the experimental treatments. Frequently, data serve their 
most fruitful purpose when some later investigator gathers from many 
places all the material bearing on some question, and it should be a habit 
to facilitate rather than hinder such research. 


3.4 Subdivision of the Sum of Squares for Treatments 

3.41 Reasons for Subdivision. Often an experiment is planned to 
provide the answers to a number of different questions, not necessarily 
connected with one another. The eelworm experiment is a partial 
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though not an Ideal example. One way in which the questions asked 
might be framed is as follows, (i) Is soil fumigation effective? Here we 
are interested in the average reduction in eelworm numbers taken over 
all fumigants, (ii) Are there differences in the effectiveness of different 
fumigants? For this question, comparisons of the reductions due to dif- 
ferent fumigants are relevant, (iii) What is the relative effectiveness of 
single and double doses? Since the simplest reasonable hypothesis is 
that the reduction in eelworm numbers to the double dose is twice that 
to the single dose, we might narrow this question by asking: Is the aver- 
age response to fumigation proportional to the amount of dressing, or in 
other words is the average response curve linear? (iv) Finally, if there is 
some indication of curvature in the responses, we might ask whether the 
amount of curvature is the same for all fumigants. 

With experiments of this type the F-test of the complete treatments 
m.s. is not particularly helpful because it is directed, as it were, at a 
mixture of several diverse questions. By an extension of the analysis of 
variance, we can subdivide the treatments s.s. into a number of com- 
ponents that are more relevant to the individual questions. Moreover, 
an F-test can be made on the mean square for each component. The 
rules for subdivision are given in succeeding sections. Effective use of 
the device requires long practice applied to a considerable number of ex- 
periments. 

3.42 Subdivision into Single Components. The simplest case, where 
all treatments have the same number r of replications, is considered first. 
The calculations are best made from the treatment totals T ? *. Examples 
of the sort of quantity that we are interested in studying are 

m T x + T 2 

Ti ~ T 2 ; Tz; T x + T 2 - (T, + T 4 ) 

Quantities of this type are called linear functions of the T s. Note that 
the sum of the coefficients of the T”s is always zero, as it must be if the 
quantity is to represent a comparison among the T s. These ideas may 
be formalized as follows. 

Definition . Any linear function 

z w = lwiT\ + I102T2 + • • * + iwbTk 
is called a comparison among the if 


livl + + * * * + Iwk «= 0 
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Rule 1 . If z w is any comparison among the T% the quantity 

o 

Z M 

~z. ~~ > u here D w = v(l w i*' ~f~ l w2 u * * *4“ Iwk**) 

■L/w 

is a component of the sum of squares for treatments and represents 1 di. 

This rule gives the divisor D w to be used so that the comparison z w 
may be included as a part of the treatments s.s. in the analysis of vari- 
ance. Incidentally 3 the rule implies that z w 2 /D w cannot exceed the treat- 
ments s.s. If it is found to do so, this is a sure sign of a mistake in com- 
putation. 

Definition . Two comparisons and z 2 are said to be orthogonal if 
hihi + h.2^22 + * * • + hkhk ~ 0 

This “sum of products” relation is very important. By theory, it 
ensures that Zi and z 2 are distributed independently of each other. Its 
relevance to the partition of the treatments s.s. appears in rule 2. 

Rule 2. If Zi and z 2 are orthogonal, then z 2 /D 2 is a component of 

Z\ 

Treatments s.s. 

Di 

This means that, if we divide the treatments s.s. into the contribution 
from a comparison z x and the remainder, and now wish to subdivide the 
remainder, we must choose comparisons that are orthogonal to z x . Sim- 
ilarly, after removing the contribution of z 2 , the next comparison must 
be orthogonal to both z\ and z 2i and so on. This leads to the next rule. 

Rule 8 . If the comparisons z 2} * • Zk—i are mutually orthogonal, 
i.e., every pair is orthogonal, then 

2 2 „ 2 

Z\ Z 2 Zfa — \ 

Treatments s.s. = 1 1 f- - — - 

D x D 2 Djc—i 

This algebraic identity partitions the treatments s.s. [which has (k — 1) 
degrees of freedom] info ( k — 1) components, each representing a single 
degree of freedom. 

Starting with a specified z h we can always find z 2 , Z 3 , * * • , 2 k~~i so as to 
construct a complete orthogonal set. In fact, it may be proved that there 
is considerable freedom of choice in this process. When all but the final 
Zh— i have been selected, there is only one possible choice for Zk~i, but at 
any previous stage many z’s can be found that are orthogonal to all pre- 
ceding 2 ? s. The experimenter must choose the z * s that are most relevant 
for purposes of interpretation. 



58 NOTES ON THE STATISTICAL ANALYSIS OF THE RESULTS 3.42 

Table 3.4 shows the single components most frequently used for inter- 
pretative purposes when there are three or four treatments. It is as- 


TABLE 3,4 Sample sets of single components 




Three treatments 



i. Equally spaced increments of one 

ii. Two qualities of an ingredient, and 

ingredient 


a 

control 



si 

22 


«i 

22 

Ti 

-1 

1 

Tx (0) 

—2 

0 

T* 

0 

—2 

T 2 (ax) 

1 

-1 

Ta 

1 

1 

(a*) 

1 

1 

Component 

Lin. 

Quad. 


Effect 

Quality 

divisor 1) 

2 r* 

6r 

Component 

of a 

cliff. 




divisor D 

6r 

2 r 



Four treatments 



i. Equally spaced increments of one 

ii. Two comparable types of ingredient, 

ingredient 


a and b, ami two qualities 

of each type 

Zl 


23 


22 

23 

Ti ~3 

+ 1 


Tj (a{) 4 1 

+ 1 

0 

T% -1 

-1 

+3 

T* (as) +1 


0 

T s -hi 

-1 

—3 

T z (bi) -1 

0 

+1 

Ti 43 

+ 1 

+ 1 

n (h) -i 

0 

~1 

Component. Lin. 

Quad. 

Cubic 


Quality Quality 

D 20 

4 r 

20r 


cliff. 

diff. 




Component a vs. b 

within a within h 




D 4 r 

2 r 

2r 


iii. Different levels of each, of two different ingredients a and b 



Z\ 

22 

23 

Ti («i&i) 

-1 

-1 

41 

(a?h\) 

4-1 


-1 

Ts (aifa) 

-1 

41 

-l 

T, feb 2 ) 

41 

'41 

41 


Average 

Average 



response 

response 


Component 

to a 

to h 

Interaction 

D 

4r 

4r 

4r 


r* ~ number of replicates, assumed the same for all treatments. The coefficients 
given above are not valid if r varies from treatment to treatment. 


sumed that all treatments have the same number of replicates. The 
divisors D are those required for inserting the square of 2 in the analysis 
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oi variance. The reader should verify that these divisors satisfy rule 1, 
and that each set of z’s forms a complete orthogonal set. 

With three treatments the two sets presented are essentially the same, 
though the meaning attached to each z is different. Case i applies in an 
experiment where we have, say, zero, single, and double dressings of a 
fumigant, and wish to isolate the linear and quadratic components of the 
response curve. The response found by fitting a straight line to the three 
levels is represented by Zi, while z 2 measures the deviation from a linear 
response. Case ii arises when we have comparable dressings of two dif- 
ferent fumigants, and wish to examine the average effect of fumigation 
(zi) and the difference between the fumigants (s 2 ). 

With four treatments, case i shows zero, single, double, and triple ap- 
plications of the same ingredient; here the response curve can be divided 
into its linear, quadratic, and cubic components. Alternatively, in case 
ii, we might be testing the wearing qualities of two types of “100% wool” 
suits and two types of suits made of wool-rayon mixtures. Case iii is 
used, for instance, when we examine the effects of two different amounts 
of sugar and two different amounts of vanilla in the preparation of a cake. 
Experiments of this type are discussed more fully in chapter 5. 

When different treatments have differing numbers of replicates, the 
rules are changed slightly. If the ith treatment has u replicates, we have 
shown that the treatments s.s. is 

!i! + II +. . .+ T JL {Tl + T *+'"+ T ^ 

n r 2 n 

The changes to be noted are: 

i. A linear function 

z-io = UoiTi + Iw^Tz H b IwhTk 

is a comparison among the treatment totals 2\ if 

**1 tw\ + rdw 2 d — • + I'kkok — 0 

ii. The divisor required for z u ? is 

D w — d~ -f- * * * "f* Y}<lwk 

iii. Two comparisons z j and z 2 are orthogonal if 

l + T %h 2^2 d~ + * • d~ Fkhkhk — 0 


(E n) 


3,43 Incomplete Subdivisions. The subdivision need not be com- 
plete in the sense that it is composed entirely of single components. In 
the eelworm experiment we suggested that the treatments s.s M with 8 
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components, might be divided into the following parts: (i) a single com- 
ponent representing the average response to fumigation; (ii) a single 
component representing the deviation of the average response from lin- 
earity; (iii) a part representing differences in the responses to the dif- 
ferent fumigants. Since there are 4 fumigants, this part comprizes 3 
components; (iv) a part representing differences in the curvature of the 
response curves for different fumigants: this also has 3 components. 
Some rules for partial subdivision will be given and applied to this ex- 
ample. The most general rule is as follows. 

Rule 4 . A set of g quantities Qi are independent components of the 
treatments s.s., with Ui degrees of freedom respectively, if 


Qi 


*■ 'il z t2 

. 

Ai A‘2 


. 4- 


A ? 


(i = 1 , 2 ,*•*</) 


where all Zij are mutually orthogonal comparisons, and the A 7 are the 
appropriate divisors. Further, the remainder 

Treatments s.s. — ^ Qi 

is an independent component of the treatments s.s. with 
[(& — 1 ) — 2 •%] degrees of freedom 

This rule, which is a deduction from previous rules, is not very helpful 
to the beginner. With experience it often becomes easy, without carry- 
ing out the details, to see whether a set of Qi can be expressed in this 
way. Some more specialized rules follow. 

Rule If Ti, • • •, T p are the totals for any set of treatments, 
then 

Q 

is a component of the treatments s.s,, with (p — 1 ) degrees of freedom. 

This rule states that the sum of squares of deviations among any sub- 
group of the treatments, with proper divisors, is part of the treatments 
s.s. 

Rule 4b* Sometimes the treatments (or part of them) can be divided 
into a number of groups. The number of treatments in a group need not 
be constant, and different treatments may have different amounts of 
replication. Let 2# be the total for the jth treatment in the till group, 
and let 

Si — Tn + ^2 + "*+ T ipi ; 

Ri » Ta -f- + ‘ * * + T ivi 


rp 2 rp 2 

n r 2 


+ 


T p z 


iU + T 2 + ■■■+T P ) Z 


(*- 2 
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Then let. 

Qi = 



Til 


Ti 2 2 

1_ [_ 

Ti 2 



fli 


O' = l, 2, •••, 3) 


0 ='!L+ g s 3 (Si+--- + £ 8 ) 3 

v ‘ +1 Ri " Up R!+--- + R s 

In this case Qi represents the sum of squares of deviations among the 
treatments in the fth group, with (p ?: — 1) degrees of freedom, while 
Q s + 1 is the sum of squares of deviations among the group totals with 
(f/ — 1) degrees of freedom. These Q’s are independent components of 
the treatments s.s. 

Rule 4-e, This applies when all treatments have the same number r of 
replicates, and every group contains p treatments. Let 




h Tr 


+ h r Ri2 H h lpT{ P 


where the set of V s remains the same in all groups. 


e-Z 


(Zi - z ) 2 


Then 


is a component of the treatments s.s., with (g — 1) degrees of freedom. 
This rule is useful when we wish to compare linear functions of the treat- 
ments within each group. Note that the divisor in Q is the same as the 
divisor that would be used for a single 
Applications. In the edvvorm experiment, the average effects of fumi- 
gation are obtained from a comparison of the total for the control with 
the totals for the single and double levels of fumigation. Consequently, 
we might regard the 9 treatments as divided into 3 groups: the control, 
the single levels, and the double levels. By rule 4b, this gives the follow- 
ing partition of the treatments s.s. (the treatment totals are found in 
table 3.2, p. 50)* 


Between fumigants (single level): 


(3 dJ.) 


(1066)* + (928) 2 + (14 3 1) 2 + (89 2) 2 
4 


(4317) 2 

16 


- 45,461 


Between fumigants (double level) : 

(1265) 2 4- (S77) 2 + (1241) 2 + (1122) 2 (4505) 2 
4 ~ 16 

(5858) 2 + (4317) 2 + (4505) 2 (14,68G) 2 


Between levels: 
(2 d J.) 


16 


48 


- 23,641 


- 88,347 


The three sums of squares add to 157,449, in agreement with the treat- 
ments s.s., 157,448, as given in table 3,3, p. 51* 
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The next, step is 1o divide the sum of squares between levels into a 
component representing the average response to fumigation and one 
representing the deviation from a linear response. By least squares 
theory, the former is given by a comparison of the double level with the 
zero level. That is, we may take 

2, = 8» - S n = 4505 - 5S58 = -1353 

where the subscript refers to the level. The deviation from linearity is 
measured by the orthogonal comparison 

z 2 = (N 2 — Si) — (S', — So) = S 2 + So — 2S, 

= 4505 + 5858 - 2(4317) = 1729 


Since each S is a total over l(i plots, the divisors are 32 and 96 re- 
spectively. Hence the contributions to the sum of squares are: 


Average linear response: 


Average curvature: 


(1353) 2 

32 


= 57,207 


(1729) 3 

90 


31,140 


The subdivision found thus far might be presented as shown below. 


Between treatments 
Average linear response 
Average curvature 
Between single levels of fumigants 
Between double levels of fumigants 


di. 

s.s. 

m.s. 

8 

157,449 

19,681 

1 

57,207 

57,207 

1 

31,140 

31,140 

3 

45,461 

15,154 

3 

23,641 

7,880 


This is not the subdivision envisaged at the beginning of this section. 
Instead, it was proposed to divide the final t> di. into 3 representing dif- 
ferences in the responses to individual fumigants and 3 representing dif- 
ferences in the curvatures of the individual response curves. This 
separation is more difficult. 

Consider a comparison of (ON) and (CS) . Estimates of the difference 
between these fumigants are available both at the single (1CN) and the 
double (2CN) level. If the effects of both fumigants are proportional 
to the amounts of dressing, the true difference at the double level will 
be twice that at the single level. On this assumption, two independent 
estimates of the difference between (ON) and (CS), for a single dressing, 
are 


[(ION) ~ (1CS)J and 


[(2CN) - (2CS)] 


2 
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The second estimate is considerably more accurate than the first, since 
its variance is only as large. Statistical theory shows that such 
estimates are combined by weighting each inversely as its variance. 

Hence, on the assumption of linearity, the most accurate estimate of 
the difference per unit dressing is 

[(1CN) - (ICS)] + (4)(§)[(2CN) - (2CS) ] 

1+4 

[(l('N) - (ICS)} + 2[(2CN) - (2CS)] /n _ 

— — (3-W) 

This shows that differences in the linear responses to the 4 fumigants 
are measured by comparisons of the quantities (1CN) + 2(2CN), etc. 
These quantities are given below. 

(Single level) + 2 (double level) 

(CS) (OS) (CM) (Civ) Total 

3590 2682 3913 3136 13,327 

It is for quantities of this kind that rule 4c is useful. This rule shows that 
the sum of squares of deviations of these quantities, when divided by 
or (4) [(l) 2 + (2) 2 ], i.e., 20, is a component of the treatments s.s. 
This gives 43,408, representing differences in linear responses to the 
fumigants. 

We now consider the final three components. The curvature of an 
individual response curve is measured by a comparison of the type 

[(2CN) - (1CN)] - [(1CN) - (0)] - [(2CN) + (0) - 2(1CN)] 

where (0) represents the total number of eelworms over 4 plots for the 
control treatment. When the difference between two fumigants is taken, 
the (0) term disappears and we obtain the comparison 

[(2CN) - 2(1 ON)] - [(2CS) - 2(1CS)] (3.19) 

It is easy to verify that this is orthogonal with comparisons like (3.18). 
Consequently differences in curvature are compared by means of the 
quantities shown below. 

2 (Single level) — (double level) 

(CN) (CS) (CM) (CK) Total 

867 979 1621 662 4129 

Rule 4c again applies. The divisor is 20, and the sum of squares con- 
tributes 25,693. The final separation is given in table 3.5. It is evident 
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from /-’-tests {.hat none of the mean squares is significant at the 5% level, 
and only that for the average linear response approaches near to the 
5% level. These results must not be taken as final conclusions for this 
experiment. As will be seen in section 3.8, additional data were recorded 
in this experiment which permit a more accurate analysis. 

TABLft 3.5 Hmuuvision of the treatments s.s. for the eelworm data 


Hourtv of variation 

d.f. 

s.s. 

m.s. 

Avorago linear ivspunse 

1 

57,207 

57,207 

Av«Tajw Hirvatiiro 

i 

31,140 

33,140 

DKToivums in linear ivspoiiso 

3 

43,408 

14,469 

jMiTrmims in curvature 

3 

25,693 

8,564 

Error 

36 

544,690 

15,130 


A, more extensive discussion of this general topic, with a number of 
examples, is given in Snedecor’s hook (3.4), chapter 15. It is worth re- 
peating that the amount and type of subdivision that should be done de- 
pend on the experiment, and to some extent on individual taste. As will 
be seen in the next section, any single component can be tested by means 
of a /-test derived from the treatment means, and some workers prefer to 
make the test in this way. 

3.5 Calculation of Standard Errors for Comparisons among 
Treatment Means - 

3.51 Rules, We first give a rule (rule 5) that is more general than 
we need, been use it is sometimes useful for other purposes. 

Rule, ~>. The standard error of any linear function 

z » kyi + Z 2 I/2 H + InVn 

of die individual observations is 

~ lo 2 4 " * * * + In 2 

The estimated standard error is obtained by substituting s for <r, where s 
is the square root of the error m.s. in the analysis of variance. This rule 
supplies the standard error of any kind of linear function, provided that 
it has been expressed in terms of the individual observations. For linear 
functions of the treatment means , one of two special cases of the rule is 
used. 

Rule 5a. If all treatments have the same number r of replications, the 
standard error of a linear function 

z SSS l t y x l 2 y 2 -j (- 
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of the treatment means yi is 


05 


— 


W 


Vz7+£ 3 2 + --- + z & 3 


finite 5b. If the ith treatment has replications, the standard error of 
z is 




yin 


h 2 k 2 w 

-f — -j j- jl 




n 


As in the general case, we substitute s for <r in order to obtain the esti- 
mated standard error. Finally, a rule that may save labor in complex 
cases is as follows. 

Rule G. If the linear functions z u z 2) * • • , z P are mutually orthogonal, 
the standard error of 


is 


Z = liZi + l 2 Z 2 +■ 1“ IpZp 

= v 1 1 2 O' I 2 + ?2 2flr 2 2 + * ’ ‘ + lp 2(r p* 


where <r* 2 is the variance of Z{. 

This rule enables the work to be done in two stages. Sometimes, in a 
large experiment, it is relatively easy to express the function z in terms 
of a number of familiar orthogonal functions, but rather tedious to write 
z in terms of original observations. The variances of the £/ may be found 
by rule 5 and substituted in rule 6. ISlote that different z>i may involve 
the same set of observations, provided that the Zi are orthogonal 


3.52 Examples. These rules will now be illustrated by application to 
the eelworm experiment. The treatment means are show in table 3.6, 
the figures in parentheses denoting the order in which the treatments are 
numbered. 


TABLE 3.6 Mean numbers or eeworms per plot 
Fumigant 


Level 

(CN) 

( 08 ) 

(CM) 

<CK> 

Means 

0 


366(9) 


366 

1 

266(5) 

232(6), 

358(7) 

223(8) 

270 

2 

316(1) 

219(2) 

310(3) 

280(4) 

281 


From table 3.5 the standard error per plot is V(15,130), or * « 123 ? 
with 36 di. 
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Example 1. Standard error of the average linear response . This is 
measured by the average response to the double dressing, i.e., 

z = j(fh + y% + fjz + ih) — IJ o ~ ~~85 


Since the first 4 means arc based on 4 replicates, while 2/9 has 16 repli- 
cates, we apply rule 5?>, which gives 




The value of Student’s t is 85/43.5, or 1.95. 

As mentioned previously, the t - test of any comparison, as made above, 
is identical with the F-test of the corresponding component in the 
analysis of variance. The basic relationship is F = t 2 ; that is, if the 5% 
values of i for n degrees of freedom are read from the table and squared, 
their squares are the 5% F values for .1 and n degrees of freedom, and 
similarly for any other significance level This result may be verified in 
the present instance. The value of t 2 is 3.80, with 36 di. From the 
analysis of variance in table 3.5, the F value for the average linear re- 
sponse is 57,207/15,130 or 3.78, with 1 and 36 di., the two values 
agreeing apart from rounding errors. The reader may check that the 
same agreement holds for the average curvature. 

Example 2. Response to the single dressing of (< CN ). The response is 
measured by 

2 » y 5 - 


Once again rule ob is appropriate. 




123 



123 V5 
~4 


68.8 


Example S. Difference between the linear responses to (CN) and ( CS ). 
In section 3.43 it was shown that this difference is 


ijh ~ m + 2 yi - 2ff 2 ] 
5 


Since ail means have 4 replicates, rule 5 a may be used. 


123 

(2X5) 


v7+ 1 + 4 + 4 = (12.3) ("vTo ) = 


38.9 


$ 2 — 
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Example J h Although the experiment is not complex enough to ex- 
hibit a profitable application of rule G, this example shows that the rule 
agrees with the other rules. Suppose that at the double level we had 
compared (CN) with (CS), and also the two chemicals with the two 
proprietary mixtures. The comparisons are 


!h + ft 2 

Si = 2/i ~ !hl s 2 = — 

Jt 

By rule 5a their standard errors are 


U3 + fh 

2 


< 7 , 


’1 



a 




2 


and it will be noted that the two functions are orthogonal. 

If now we wished to compare the double dressing of (CN) with the 
mean of the 3 other double dressings, the comparison would be 


* “ Vi ~~ 


f /2 + Vs + Th 


3 


By a direct use of rule 5a, we have 


(r 


a z 


2 


1 1 1 

14 1 f- - 

9 9 9 


V3 


But alternatively we may write 

_ 2*1 2^2 
Z ~ 3 + 3 

and apply rule 0 to give 



in agreement with the more direct method. 


3,53 Testing Effects Suggested by the Data. In order that F- and t- 
tests be valid, the tests to be made in an experiment should be chosen 
before the results have been inspected. The reason for this is not hard 
to see. If tests are selected after inspection of the data, there is a natural 
tendency to select comparisons that appear to give large differences. 
Now large apparent differences may arise either because there are large 
real effects, or because of a fortuitous combination of the experimental 
errors. Consequently, in so far as differences are selected just because 
they seem to be large, it is likely that an undue proportion of the cases 
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selected will be those where the errors have combined to make the dif- 
ferences large. The extreme case most commonly cited is that of the ex- 
perimenter who always tests, by an ordinary f-test, the difference be- 
tween the highest and the lowest treatment means. If the number of 
treatments is large, this difference will be substantial even when the 
iron I meats produce no real differences in effect. It may be shown that 
with 3 t md meats the observed value of l will exceed the 5% level in the 
table about 13% of the time. With f> treatments the figure is 40%, with 
10 treatments 00%, and with 20 treatments 90%. Wien the experi- 
menter thinks that he is making a West at the 5% level, he is actually 
testing at the 13% level, or the 40% level, and so on. To summarize, 
the selection of those differences that look large and therefore (i in- 
teresting” invalidates the ordinary tests of significance. The effect is to 
obtain too many significant results, or to raise the significance level of 
the test from the presumed 5% to some higher level, usually unknown. 

On the other hand, the rule that no tests must be constructed after 
seeing the data seems contrary to sound scientific practice. Often the 
initial experiments in a line of research are conducted in order to “see 
what happens”; the research worker does not know in advance what 
comparisons he may wish to test. And even where hypotheses about 
the nature of the results can be set up, experiments often indicate 
strongly, as is later confirmed, that these hypotheses are erroneous and 
that the method of analysis based on them is inappropriate. Con- 
sequently, although the effects of selection cannot be ignored, they should 
not deter the experimenter from the most careful examination of his 
data. Any difference that is of interest, whether anticipated or not, 
should be tested. If the ordinary tests show that the difference is not 
rigid! leant, then at least we know that the difference can be accounted 
for without supposing any real effect. If an unexpected difference is 
statistically significant, the possibility of a selection effect should be 
borne in mind. When the point at issue is important, the best pro- 
cedure is to conduct a new experiment specifically designed to confirm 
or disprove the indications from the previous results. 

The reporting of such effects presents difficulties. The following is a 
suggestion. “The experiment also indicated an unexpected effect of 
***• Although this was statistically significant at the 5% level, it is 
contrary to previous experience, and must at present be regarded with 
caution. Further work on this point will be undertaken.” The diffi- 
culties with this kind of writing are that we do not wish to imply that 
the effect is definitely established, and yet the writing should not en- 
courage the reader to ignore any result that is contrary to our precon- 
ceived notions. 
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3.6 Subdivision of the Sum of Squares for Error 

3.61 Reasons for Subdivision. The sum of squares for error can be 
partitioned into components in the same way as that for treatments. 
Although such subdivisions are not so frequently required as with treat- 
ments, they have a number of uses. Sometimes there is reason to believe 
that the error s.s. is not homogeneous; that is, the residual errors e do 
not all have the same variance <r 2 as postulated in the mathematical 
model. In this case, as will be shown in section 3.63, subdivisions of the 
error may be necessary in order to obtain valid Wests. Occasionally a 
gross error in some observation may be suspected, and it is helpful to 
calculate the contribution of this observation to the error s.s. In ad- 
dition, a subdivision may help in understanding the nature of the error 
m.s. 


3.62 Rules for Subdivision. These follow the same general pattern 
as with treatments, though they are a trifle more complicated. It is 
necessary to go back to the normal equations (3.4) from which the treat- 
ment and block effects were estimated. These equations took the form 

23 (Uijk - m - U - bj ) = 0 (3.4) 

In oilier words, the residuals that provide the estimates of error must 
add to zero over any treatment or over any block. Consequently, if the 
linear function 

is to be a component of the error, the coefficients must sum to zero 
over all observations that receive any specified treatment and also over 
all that are in any specified block. These are the tests by which we tell 
whether any proposed z is part of the error. 

For any z which satisfies these tests, the contribution to the error s.s. 
is z 2 /D } where by rule 5 

d = 23 W 2 

We now examine some components of the error s.s. in the eelworm ex- 
periment. Obviously, the difference z between the eelworm numbers on 
any two control plots in the same block satisfies the conditions. Sim- 
ilarly any comparison among the 4 control observations in a block is a 
component of the error. It follows that the sum of squares of deviations 
of these 4 observations from their mean contributes 3 d.f. to the error 
s.s. The 4 blocks together contribute 12 d.f. of this type of error. The 
sum of squares will be found to be 307,312. This component is a meas- 
ure of the amount of variation among observations that receive the same 
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treatment and lie in the same block, and is the type of component that 
we should expect to constitute the error. However, this type of com- 
ponent is encountered only when some treatments have been replicated 
' within the block. 

Consider now the difference between two other treatments, say (1 CN) 
and (ICS), Let 

- ( 16 W)i - (IftS)i 

where the subscript 1 denotes that the two observations come from 
block 1. This function is not a component of the error, for, while the 
coefficients add to zero over every block, they do not do so over treat- 
ments (ICAO and (ICS). But if we take 

(1 CN) t - (lCS)i - (1 CNh + (1 CS) 2 

the conditions are satisfied. This expression shows how the difference 
between two treatments changes from one block to another. It is a 
measure of the effect of the blocks on the treatment difference, and % 
usually called an interaction of treatments with blocks. Such inter- 
actions are the typical components of error. 

If Zi is the difference between (ICAO and (ICS) in the ith block, any 
comparison among the % satisfies the rules required for a component of 
the error. Hence the sum of squares of deviations of the Zi, when divided 
by 2, contributes 3 di. to the error s.s, A similar result holds if we take 
any comparison among the treatments and calculate it separately for 
each block. Since the 9 treatments provide 8 independent comparisons, 
we see that the 24- di. for the interactions of treatments with blocks may 
be divided into 8 sets of 3 di. The sum of squares for these 24 di, can 
of course be found by subtraction as shown in table 3.7. 


TABLE 3.7 Subdivision or the sum of squares for error 



tU. 

S.S. 

m.s. 

Error 

36 

544,690 

15,130 

Among controls 

12 

307,312 

25,609 

Treatments X blocks 

24 

237,378 

9,891 


The /'’-ratio of the two components of error is 2.59, with 12 and 24 di . In 
the ordinary table this corresponds to a significance level of about 2%%. 
It is probably more correet to make a two-tailed test by doubling this 
probability. Either test indicates that the controls are more variable 
than the other treatments. This is not very surprising, since in data of 
this type the variance may tend to increase as the mean increases. In 
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any event the result throws doubt on the suitability of the original 
mathematical model; it would seem better to ascribe different error 
variances to the control and the fumigants. 

The control also contributes to the treatments X blocks component 
of the error. The 8 comparisons among the 9 treatments can be divided 
into 1 between the control and all fumigants, and 7 which are com- 
parisons among the fumigants themselves. The former will contribute 
3 di. to the treatments X blocks interaction; the latter, 21 d.f. If the 
control is more variable than the fumigants, we would expect the mean 
square for the 3 d.f. to be larger than that for the 21 d.f., though it should 
be remembered that a mean square with only 3 di. is poorly determined. 
In the next section these two components will be computed separately. 

3.83 Calculation of a Separate Error for a Treatment Comparison. As 
the eehvorm example illustrates, certain treatments may be erratic in 
their effects, while others in the same experiment show more stable re- 
sults. When this occurs, comparisons involving the erratic treatments 
have a higher experimental error variance than those among the more 
stable treatments. Since the error m.s. in the analysis of variance is some 
weighted average of these different variances, its use for individual £- 
tests may not be valid. In such cases it is helpful to be able to calculate 
a separate error for any specific treatment comparison. The procedure 
will be illustrated for the comparison between the control and the 
fumigants. 

The first step is to compute the comparison separately for each block. 




Totals in block 




i 

2 

3 

4 

Total 

Control 

1814 

1205 

1769 

980 

5858 

Fumigants 

2589 

1780 

2083 

1490 

8822 

2 (Control) — (Fumigants) 

1050 

810 

555 

470 

2804 


Since there are 4 control plots and 8 fumigated plots in a block, the con- 
trol total is multiplied by 2 in forming the comparison. The contribu- 
tion to the error s.s. is 

(105 9) 2 + (S10) 2 + (555) 2 + (47Q) 2 _ „(2894) 3 ^ gg ^ 

~24 “ 90 

The divisor 24 is found by the usual rule. The mean square is 2954, 
with 3 di. By subtraction, the mean square for the interaction of fu- 
migants with blocks (21 d.f.) is found to be 10,882. Contrary to an- 
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ticipation, the component involving the control has the smaller mean 
square, though the difference does not approach significance. 

As often happens when the simple model is inadequate, the best 
approach for a more accurate analysis is questionable. It is possible 
to compute a separate error, by the method just given, for each West 
that is made. This is the procedure least open to criticism, since it 
makes no assumption that errors are constant from one treatment to 
another. Since, however, each West would be based on only 3 d.f., 
these tests are insensitive, so that this method should not be used with- 
out good reason. In the present case it is probably justifiable to pos- 
tulate only two error variances. The first, appropriate to the controls, 
is estimated by the mean square 25,009, with 12 d.f. ; the second, for the 
fumigants, by the mean square 10,882, with 21 d.f. By this approach 
the variance of the mean response to the double dressing, for example, 
is estimated as 

10,882 25,609 

— + — 2281 

16 16 

since each mean is taken over 16 plots. 

Separation of the error s.s. into single components is seldom required 
except in special studies or as an exercise. One method is to set up 8 
orthogonal comparisons among the treatments. Calculate each com- 
parison separately in each block, and let % be the value obtained for the 
ith comparison (i = 1, 2, • * •, 8) in the jth block. Then find 

385 Z%\ ™ %il “b %i2 2s.f3 J 

Wft s» Zii + Z{2 + Ziz — ZZi 4 

Each w is a component of the error, and the whole 24 w’ s are mutually 
orthogonal. The squares of the w* s, each with its proper divisor, give a 
separation of the error s.s, into single components. 

3.7 Missing Data 

3.71 Method of Handling Missing Data in the Analysis. From time 
to time certain observations are missing, through failure to record, gross 
errors in recording, or accidents. The omissions naturally affect the 
method of analysis. With each of the common designs we give computa- 
tional instructions for analysing data that contain gaps. The object of 
this section is to indicate the theoretical basis for these methods. 

When certain observations are absent, the correct procedure is to 
write down a mathematical model for all observations that are present 
The least squares normal equations are then constructed in the usual 
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way. These take exactly the same general form as when all observa- 
tions are present ; i.e., 

£ (yijk - TO - k - hi) = 0 (3.20) 

over all observations whose equations contain any specified parameter 
that is to be estimated. Since, however, the terms in the equation cor- 
responding to missing observations are absent, the system of equations 
loses some of the symmetry that it possesses when all observations are 
present, and the solutions are more difficult. The same general procedure 
supplies F- and Hosts of hypotheses about the nature of the treatment 
effects as described in section 3.24, though again the details become more 
complicated. 

In the analysis of variance, two changes may be noted. Owing to 
the missing observations, the treatments and blocks s.s. become en- 
tangled, so that the treatments s.s. must be computed after allowing 
for block effects, as mentioned at the end of section 3.26. Secondly, 
if a observations are absent, the total number of degrees of freedom 
is reduced by a. Unless one or more complete treatments or blocks 
is missing, the number of parameters required to describe these effects 
will be the same as before. Consequently, the missing degrees of free- 
dom all (tome from the error s.s., which now represents (n e — a) degrees 
of freedom. In short, missing data may be handled by applying the 
standard least, squares procedure to all observations that are not miss- 
ing. For future reference, this method will be called the “correct least 
squares procedure.” 

To the experimenter it may be a difficult business to carry out the 
construction and solution of a set of unfamiliar normal equations, even 
though he is quite competent to analyze a set of complete data. For this 
reason Yates (3.9), following a suggestion by Fisher, considered inserting 
values for the missing observations so as to obtain a set of complete data. 
Suppose that only a single observation is missing, and that a value x is 
substituted for this observation. If the analysis of variance is calculated 
in the usual manner for complete data, the error s.s. is found to be of the 
form 

Ax 2 - 2Bx + C 

where A, B, and C are numbers that are determined by the type of de- 
sign and the values for the other observations (A is always positive). 

In order to find a numerical value for x, Yates proposed to use the 
value that minimizes the error s.s. This is x = B/A. If this value is in- 
serted in place of the missing observation, and if the data are analyzed 
as if no observations were absent, Yates showed that several important 
properties hold, (i) The estimates of treatment and block effects are 
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exactly the same as those obtained by the correct least squares pro- 
cedure. (ii) The error s.s. is exactly the same as given by the correct 
procedure, (iii) To obtain the correct partition of the degrees of freedom, 
we subtract 1 from the total s.s. and 1 from the error s.s. 

Yates also showed that the method of insertion fails to agree with the 
correct least squares procedure in two respects. The treatments s.s., as 
obtained in the analysis of variance of the “complete” data, is always 
slightly larger than the correct treatments s.s. for an Attest of the treat- 
ments. Unless an appreciable fraction of the total observations is miss- 
ing, this overestimation is unlikely to be large ; further, the exact F - test 
can be obi ained by means of some additional calculations. The second 
defect of the method of insertion is that it may not give proper Wests. 
That this will happen is clear, because in the analysis of “complete” 
data r replications are ascribed to the treatment that contains the missing 
observation, whereas there are only (r — 1) replications. To allow for 
this disturbance we give special rules which provide Wests that are ap- 
proximately correct. 

Thus the method adopted with a missing value is first to “estimate” 
this value by means of the formula BfA, which will be presented for the 
common designs. This estimate is used in place of the missing value, and 
the rest of the analysis is conducted as if the data were complete (except 
for the changes in degrees of freedom). Special methods are available 
for exact /Wests and for Wests. If several observations are absent, a 
repeated application of the formula enables values to be substituted for 
each missing observation. 

This method is essentially an ingenious computational device whose 
purpose is to enable the easy computations that apply to complete data 
to be used even when data are incomplete. Substitution of estimates 
for the missing data does not in any way recover the information that is 
lost through loss of data, as some experimenters have suggested, usually 
facetiously; it merely attempts to reproduce the results obtained by an 
application of the least squares method to the data that are present. 
The only complete solution of the “missing data” problem is not to have 
them. 

3,8 The Analysis of Covariance 

3.81 Purpose of the Technique. As indicated in section 3.2, experi- 
ments can be planned so that certain types of environmental effect are 
eliminated from the estimates of the treatment effects, with the result 
that these estimates are made more accurate. In the eehvorm experi- 
ment, where the plots were grouped into blocks of 12 plots each, any dif- 
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ferencort from block to block in the severity of eelworm infestation were 
eliminated in thin way, On the other hand, differences in infestation 
from plot to plot until in the mme block are not controlled by the design and 
do contribute to the experimental errors, since treatments were assigned 
at random to plots within each block. Accordingly, before the fumigants 
were applied, samples were taken in order to estimate the natural in- 
festation on each plot. 

The analysis of covariance shows how to use these supplementary data 
to reduce the experimental errors by eliminating the effects of variations 
in the initial infestation within a block. The technique is potentially 
very useful. It often happens that some source of variation which can- 
not be controlled by the design can be measured by taking additional 
observations. Whenever this is so, the analysis of covariance can be 
utilized, often to great advantage. One caution is that, since the ad- 
ditional observations are to measure environmental effects, they must not 
be influenced by the treatments. The situation where such measure- 
ments are influenced by tin* treatments is discussed in section 3.88. 

3.82 Initial Steps in the Analysis. The first step is to construct a 
new nva.tlu4na.Uctd model. If refers to the final eelworm count and 
:c;j k to the initial, the relation is 

!fijk ^ M + T i + $/ + T U'ijk ~~ if') + ?ijk (3.21) 

The only change is the introduction of a new term to describe the effect 
of the initial eelworm number. We have assumed that the, effect is 
linear, i.t\, it is a constant multiple y of the amount by which the initial 
eelworm number X(j k on t he plot differs from the average initial number 
x for the whole experiment. 

As before, the unknowns are estimated by least squares. In this 
case we minimize 

23 E l/ijk ~~ >n ~~ ti ~ bj - cfxijk - 3)] 2 (3.22) 

The normal equation for m is again of the form 

23 l//*v* ■"* m - ti - bj - r{Xijk - £)] = 0 (3.23) 

over all observations. If the same linear restrictions as before arc ap- 
plied to the ti and the fy, equation (3.23) implies that m is the mean of the 
ytj k . The equation for U is the same as (3.23) except that the sum is over 
those observations that receive U The equation may be rearranged as 

rprt rfi l iy o(T{ X ~~ t$) (3.24) 



76 NOTES ON THE STATISTICAL ANALYSIS OP THE RESULTS 3. S3 

where n is the number of replications, and T denotes a treatment total. 
This gives 

— -«(—-*) ( 3 . 25 ) 

n \ n / 

The important feature of this equation is that in order to obtain l h the 
observed treatment mean of the y’s (Tiy/t'i) is adjusted , the adjustment 
depending on the treatment mean of the Xt is this adjustment that 
removes the effect of the initial infestation. 

The equation for c, the adjustment factor, will not be developed in 
full. It turns out that c is given by the ratio of the error sum of products 
of y and x to the error s.s. of x. 

3.83 Computations. In practice we start with a joint analysis of the 
sums of squares and products of y and x. In order to illustrate certain 
features of the covariance technique, the treatments s.s. will be sub- 
divided into “linear” and “curvature” components, as was done for y 
in table 3.5 (p. 64). The original values for x are given above the y 
values in table 3.1 (p. 40). The treatment totals and the quantities 


TABLE 3.8 Treatment totals for numbers of cysts * 


; 

Before fumigation (x) 

After fumigation (?/) 

Level 

CN 

CS 

CM 

CK 

Total 

CN 

CS 

CM 

C!K 

Total 

0 


1975 


1975 


5858 


5858 

1 

402 

417 

513 

570 

1902 

1066 

928 

1431 

892 

4317 

2 

389 

554 

508 

778 

2289 

1265 

877 

1241 

1122 

4505 

2(2) +(1):L 

1180 

1625 

1049 

2120 

6480 

3596 

2682 

3913 

3136 

13327 

2(1) - (2):Q 

415 

280 

458 

362 

1515 

867 

979 

1621 

662 

412!) 


* Numbers of cysts per plot were the numbers found in 400 grams of soil. Thus 
the totals in the table represent 1600 grams of soil, except for the no-fumigant totals, 
which represent 6400 grains. 


needed for isolating the linear response and the curvature are reproduced 
in table 3.8. The analysis of the sum of squares of x follows by the 
same methods as given previously for y. 

To analyze the sum of products, we carry out the same operations as 
for a sum of squares, except that at every stage a square is replaced by 
the corresponding product. A few examples will suffice. 
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3.S3 


Total: (2t>!>R-M0) + (.2331(280) H f (48){298) - - ^- - 6 ^ 14,68 P 2 

48 

, U975H5S5S) 0102K1000) , . (778}(1122) 

Treatments: r - + * * * -\ — - — 

It) 4 4 

(6160) (14,680) 
48 


355,929 


-9222 


Average linear response; 


12289 - 1975) (4505 - 5858) 
32 


• -13,276 


The complete analysis of covariance is shown in table 3.9. 


TABLE 3,9 Hums op squares and products (x « before, y « after, 

fumigation) 



di. 

(a 2 ) 

(!/*) 

(: V 2 ) 

Blocks 

3 

159,618 

175,873 

289,427 

Treatments 

8 

29,142 

—9,222 

157,448 

Average linear response 

1 

3,081 

—13,276 

57,207 

A vi ‘rage curvature 

I 

2,204 

8,285 

31,140 

Differences in linear 

3 

22,975 

-6,837 

43,408 

Differences in curvature 

8 

882 

2,606 

25,693 

Em )T 

30 

121,408 

189,278 

544,690 

Total 

47 

310,168 

855,929 

991,565 


If we denote the sums of squares and products for error by E xx , E yy , 
and respectively, 1 he? adjustment factor c, or regression coefficient 
of y on d\ is given by 


AV = 189,278 
Jib, 121,408 


1.559024 


It may seem surprising that the coefficient is greater than 1, This is 
probably explained by the great seasonal increase in the eehvorm num- 
bers, obvious from table 8.8. 

The residual error s.s. may now be found. The original error s.s. is 
544,090, with 80 di. To remove the effect of the regression on the 
initial eehvorm numbers, we subtract 


P 2 

IP. 


(180, 278) 2 

Til, 408 


= 295,089 


}L 


1W 


E*X 


544,690 - 295,089 = 249,601 


Thus the residual s.s. is 249,601, with 35 d.f., since 1 di. must be sub- 
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where r, is the number of replications, and T denotes a treatment total. 
This gives 

w + (3.25) 

n \ Vi / 

The important, feature of this equation is that in order to obtain U > the 
observed treatment mean of the y’s (Tjy/rj) is adjusted, the adjustment, 
depending on the treatment mean of the ar’s. It is this adjustment that 
removes the effect of the initial infestation. 

The equation for c, the adjustment factor, will not be developed in 
full. It turns out that e is given by the ratio of the error sum of products 
of y and x to the error s.s. of x. 

3.83 Computations. In practice we start with a joint analysis of the 
sums of squares and products of y and x . In order to illustrate certain 
features of the covariance technique, the treatments s.s. will be sub- 
divided into “linear” and “curvature” components, as was done for y 
in table 3.5 (p. 64). The original values for x are given above the y 
values in table 3.1 (p. 40), The treatment totals and the quantities 


TABLE 3.8 Treatment totals for numbers of cysts * 


1 

Before fumigation (x) 

After fumigation (•?/) 

Level 

ON 

as 

CM 

CK 

Total 

CN 

as 

CM 

CK 

Total 

0 


1975 


1975 


5858 


5858 

1 

402 

4X7 

513 

f>70 

1902 

1066 

928 

1431 

892 

4317 

2 

380 

554 

568 

778 

2289 

1265 

877 

1241 

1122 

4505 

2(2) +(I ):L 

1180 

1525 

1640 

2126 

6480 

3596 

2682 

3913 

3136 

13327 

2(1) —(2); Q 

415 

280 

458 

362 

1515 

867 

979 

1621 

662 

4129 


* Numbers of cysts per plot were the numbers found in 400 grams of soil. Thus 
the totals in the table represent 1600 grams of soil, except for the no-fumigant totals, 
which represent 6400 grams. 


needed for isolating the linear response and the curvature are reproduced 
in table 3.8. The analysis of the sum of squares of x follows by the 
same methods as given previously for y. 

To analyze the sum of products, we carry out the same operations as 
for a sum of squares, except that at every stage a square is replaced by 
the corresponding product. A few examples will suffice. 
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Tof.il: + i.2S3}(2S0) +•••+ (48) (208) - ^~’ 680) - 

48 

_ , (1075)^5858; , (4O2X106G) , , (778) (1122) 

Treatments: — H f- • • * + - — - - 

li> 4 4 


355,929 


(6 166) (14,680) 
48 


-9222 


Average linear response: 


(2289 - 1975) (4505 - 5858) 
32 


- -13,276 


The complete analysis of covariance is shown in table 3.9. 


TABU 1 ] 3.9 *Sr:us of squares and products (x - before, y ~ after, 

fumigation) 



d.f. 

t'.r) 

tyr) 

iy % ) 

Blocks 

3 

159,618 

175,873 

289,427 

Treatments 

8 

29,142 

-0,222 

157,448 

Average linear response 

J 

3,081 

-13,276 

57,207 

Average curvature 

1 

2,204 

8,285 

31,140 

Differences in linear 

3 

22,975 

-6,837 

43,408 

Differences in curvature 

3 

882 

2,606 

25,693 

Error 

36 

121,408 

189,278 

544,690 

Total 

47 

310,168 

355,929 

991,565 


If wo denote the sums of squares and products for error by E xx , E ym 
ami respectively, the adjustment factor u, or regression coefficient 
of y on a is given by 


E yx 189,278 

EZ ~~ 12M08 


1.559024 


It may seem surprising that the coefficient is greater than 1. This is 
probably explained by the great seasonal increase in the eelworm num- 
bers, obvious from table 3.8. 

The residual error s.s. may now be found. The original error s.s. is 
544,(190, with 30 d.f. To remove the effect of the regression on the 
initial eelworm numbers, we subtract 


Ej£ _ (18 9,278) 2 
IT" ~ 121,408 


- 295,089 


E, 


vu 


V 2 
ij yx 

~eZ 


= 544,690 - 295,089 = 249,601 


Thus the residua! s.s. is 249,601, with 35 d.f., since 1 d.f. must be sub- 
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traded for tlic additional parameter 7. The residual m.s. is 
249,001/35, or 7131. This is less than half the original mean square 
of 15,130, and indicates that the use of covariance has approximately 
doubled the accuracy of the experiment. 

One feature of the covariance method deserves comment at this point. 
In the model the effect of x was assumed to be linear, but no assumption 
was made about the strength of the effect. That is left to be determined 
by the data. If the eelworm numbers on each plot had remained un- 
changed throughout the season, except for the influence of the treat- 
ments, and if the eehvorm numbers had been accurately measured, the 
residual s.s. would have been zero. The residual s.s. as found presumably 
represents a contribution due to seasonal variations in infestation and 
one due to the fact that the eelworm numbers were not estimated ac- 
curately, being obtained only from small samples of soil. If the variable 
x had had no linear effect on y, the residual m.s. would have been the 
same as the original mean square, apart from sampling fluctuations. 

3.84 The Adjusted Treatment Means. A little time is saved if the 
adjusted treatment totals are found first. From equation (3.24) the 
adjustment to the ith treatment total of y is 

ix r i£) 

where r t is the number of replicates, and T the general mean of x (128.46). 
Thus for the control the adjusted total is 

5858 - 1.5590(1975 - 16 X 128.46) = 5858 + 125 = 5983 
and for (WN) 

1066 - 3.5590(402 - 4 X 128.46) - 1066 + 174 - 1240 
The means are shown in table 3.10. 

TABLE 3.10 Adjusted treatment means (eelworms per 400 um. soil) 


Fumigant 

Level of ■ — — 


dressing 

CN 

cs 

CM 

OK 

0 


374 


1 

310 

270 

358 

201 

2 

305 

204 

289 

178 


3.85 f-tests. These tests are slightly complicated by the fact that 
account must be taken of the sampling error of the adjustment factor c. 
The difference between two adjusted means may be written 

Vi ~ fa “ c(x 1 - x 2 ) 
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From regression theory the variance of this quantity is given by 


(1 + L\ + (.fj - .C 2 ) V = ffyx 2 

'1,1 . Oh ~ x 2 ) 2 ~ 

1 f~ _ 

\j"j r 2 / 

in ?a e X£ j 


(3.26) 


where is the residual error variance, and E xx the error s.s. for x. As 
an estimate of we use the residual error m.s., in this case 7131 with 
35 di. 

Example. A Nest of the reduction to the double dressing of CS. The re- 
duction is (374 — 204), or 170 eel worms per sample. The estimated 
variance is 


(7131) 


1 i (123.44 - 138.50) 2 ' 

1 1 

16 4 121,408 


(7131) (0.31437) = 2242 

(3.27) 


the means of the ris being found by division from table 3.8. Hence 

170 170 


t «= 


- 3.59 


a/2242 47.35 

with 35 d.f., which is highly significant. 

For a more complicated comparison of the adjusted means, 

£ lilfn - e(*< - *)1 

the estimated variance is 




'2/*# 


U 2 , (£za ) 2 


£~ + 

r 


E; 




(3.28) 


(The term in x vanishes because for any comparison the sum of the V s 
must be zero.) 

One annoying feature of these tests is that the x values enter into the 
variance, so that every comparison necessitates a separate computation 
of the variance. As a time-saving approximation, Finney (3.10) has 
proposed that an average value for the contribution of the term in the 
ris may be used. This amounts to using 


s v-x 


1 + 


ExxJ 


= (7131) 


1 + 


3643 


121,408. 


- (7131)(1.Q30) - 7345 


as the effective residual error m.s., where t xx is the treatments mean square 
for x. The term in brackets represents the average contribution from 
the x’s, or in other words the contribution from sampling errors in the 
adjustment factor c. 
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80 ■ riE llie diftorenee between the adjusted means 

of the control and 2C,S,^c wo 

2295 


(7345) (7 + "') = ^° 45 ' ) (id + 4/ 

\V\ 1 2' 


. Lo 0 . iwn by the more exact expression (3.27). 
instcd of the value 2242 given 0.1 

Similarly* instead of (3.28), 

(73-151 £ 

11 fmnrl enough if the number of error de- 

This approximation is ^ euses thp contribution from 

grees of freedom excee b *■ m ^ tot may be used if is sum . 

errors in c is small, me n 

+ rrn f, d that the same adjustment factor appears in 
3.86 F-tests. riu, fact t ^ The procedure is shown 

every treatment mean also milueniAs 

in table 3-11. 

TABLE 3.U »r « THB ANA!,™* OF COVAHlANt’K 

1 Kfsidimls 


Treatments 

Error 

d.f. 

8 

86 

(a?) 

20,142 

121.40S 

(yx) 

—0,222 

180,278 

Or) ! 
157,448 j 

544,000 | 

(If. 

8 

85 

237,102 

240,001 

T + E 

44 

150,550 

180,05(5 

702,138 

13 

486/7U3 


IU.S. 


7,131 


*. t -- 

it 1 o J o, P vertical line are from the previous analysis 

X wsiss :,r zz 

— .1, to a ~ 

150,560 

Itm Lrupleiei for the error toe; actually, ” 

S?Ii « tor error from that tor — + 

9^7 102 = 486.793 - 249,601 



3.87 THE INCREASE IN ACCURACY DUE TO COVARIANCE SI 

It always lias the same number of degrees of freedom as the original 
treatments s.s. The F-test of the adjusted treatment means is given by 
the ratio of the residual m.s. 


29,619 


« 4.1(3 


with 8 and 35 dX 

If it is desired to test some component of the treatments s.s,, the same 
calculation is made with the component in place of the treatments s.s. 
throughout. If several components are to be tested, this becomes rather 
tedious. A useful approximation is to construct from the original anal- 
ysis of covariance in table 3.9 an analysis of (y ~ cx) 2 . This is most 
easily done by multiplying each term in fir) by c 2 , or 2.43048, each term 
in (yx) by —2 c, or —3.11805, and adding the two products to ( y 2 ). The 

results are shown in table 3.12. 

. * 


TABLE 3.12 Analysis of variance of (jj — c$) 


d.f. 


Treatments 8 

Average linear response 1 

Average curvature 1 

Differences in linear 3 

Differences in curvature 3 

Error 35 


H.H. 

m.s. 

F 

F l 

257,032 

32,129 



100,000 

100,01)0 

14.88 

14.51 

10,064 

10,664 

1.50 

1.47 

120,566 

40,189 

5.64 

5,05 

19,711 

6,570 

0.92 

0.92 

249,592 

7,131 




It. will be noted that this calculation gives the correct residual error 
m.s. However, sums of squares for components of the treatments are 
always larger than those given by the more roundabout correct pro- 
cedure. Thus the F values shown on the right are all too large. The 
overestimation is seldom great, and the F values serve for a preliminary 
inspection. Those that arc just beyond the significance level may he 
recomputed by the correct procedure if it is thought worth while. For 
comparison, the F' values shown above are those obtained by the 
correct procedure (it is a useful exercise to check them). With either 
F or F f there is a significant average linear response and significant 
differences among fumigants in their linear responses, while the curva- 
ture terms do not approach significance. In the analysis made without 
covariance (table 3.5, p. 64) no component was significant, 

3,87 The Increase in Accuracy Due to Covariance. From table 3.9 
we see that with no covariance the error s.s, for y is 544,690, with 36 
di., giving an error m.s. of 15,130. A comparable figure when covariance 
is used is the effective residual m.s., 7345. This is preferable to the 
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residual m.s, itself (7131) because it makes allowance for the sampling 
error of the adjustment factor c. The accuracy obtained with covariance 
relative to that without covariance is estimated by 15,130/7345, or 2.06. 
The use of covariance appears to have had about the same effect as 
doubling the number of replicates. In making this comparison, we ig- 
nored the effect of the reduction in error di. from 36 to 35, because it is 
negligible. 

As in ordinary regressions, the x variable may be transformed to an- 
other scale if this is likely to produce a more linear relation with y. The 
use of log x, for instance, is common in biological work. Two or more 
different x variables may be used. The calculations for this case are 
described by Snedccor (3.4, chapter 13). 

3.88 The Case Where the x Variable is Influenced by the Treatments. 

In a covariance analysis the treatment mean f/i is adjusted by the amount 
— c(x{ — x). The effect of the adjustment is to change each f/i to the 
value that it would be expected to have if all treatments had the same 
x mean. It is in this way that the technique removes the effect of 
variations in the x;. If, however, the differences among the Xi are in part 
produced by the treatments, the adjustment removes part of the treat- 
ment effect and its interpretation is changed. Consequently, in the 
standard use of covariance, it is important to be sure that the treatments 
did not affect the x values. Sometimes this is obvious, as in the eel- 
worm experiment, because the x values were recorded before the treat- 
ments were applied. A more doubtful case is that of the number of 
plants per plot in a field experiment, counted after the application of 
the treatments, which may or may not influence plant numbers. An F~ 
test of the x values is helpful in such cases. 

Where the treatments do affect x, a covariance analysis may add in- 
formation about the way in which the treatments produced their effects. 
In the eeiworm experiment, the yields of the oats which were grown on 
the plots were obtained. Since eel worms attack oats, it would be in- 
teresting to know whether the effects of the treatments on the oats were 
simply a reflection of their effects on the eelworms. This is examined 
by a covariance analysis in which the oats yields are the y values, and 
the eeiworm numbers after harvest are the x values. If the F-test of the 
adjusted treatment means still shows significance, the conclusion is that 
not all the treatment effect on the oats can be attributed to the reduction 
in eeiworm numbers. This happened in the present instance, because 
some treatments supplied nitrogen to the crop, and therefore acted in 
part as fertilizers as well as fumigants. As Bartlett (3.11) has pointed 
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out, the interpretation of this use of covariance requires care, since a 
hidden extrapolation may be involved. 

When the x variables are influenced by treatments, the short-cut 
method for Ftests by use of the effective residual error and the approxi- 
mate F-tests by means of an analysis of (y — cx ) should be avoided, 
since they may be seriously in error. 


3.9 Effects of Errors in the Assumptions Underlying the Analysis of 

Variance 

The assumptions made in the analysis of variance are that treatment 
and environmental effects are additive, and that the experimental errors 
are independently distributed in the normal distribution, with a common 
variance. In practice we can never be sure that these assumptions all 
hold, and often there is good reason to suspect that some are false. The 
consequences of failures in the assumptions and the remedial steps to 
be taken have been summarized recently by Eisenhart (3.12), Cochran 
(3.13), and Bartlett (3.14). Only a few comments will be given here, 

As a rule, the failure of an assumption will affect both the significance 
levels and the sensitivity of F~ and Ftests. When the experimenter 
thinks that he is testing at the 5% level, he may actually be testing at 
the 8% level. Usually, though not invariably, the true significance 
probability is larger than the apparent one; that is, too many significant 
results are obtained. Also, there is usually a loss of sensitivity, in the 
sense that a more powerful test than the analysis of variance F-test 
could be constructed if the correct mathematical model were known. 
There is a corresponding loss of accuracy in the estimates obtained for 
the treatment effects, since these, too, could be made more accurate if the 
correct model were known. 

Although generalization is hazardous, experience suggests that in the 
majority of experiments, at least in the held of biology, these disturb- 
ances are not sufficiently great to invalidate the technique. They do 
imply, however, that significance levels and confidence limits must be 
considered approximate rather than exact. For the same reason, the 
inflexible use of say the 5% significance level to divide the effects into 
those that are regarded as “real” and those that are not is hardly jus- 
tifiable. 

The most serious disturbances appear to arise when the experimental 
error variance is not constant over all observations. Sometimes this 
happens, as mentioned previously, because certain treatments are erratic 
in their effects. In such cases, the appropriate error variance for com- 
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paring one pair of treatments might be four times as large as that for 
another pair, and the use of the same estimated variance for both com- 
parisons would lead to Wests that were completely erroneous. Where 
this type of disturbance is suspected, the remedy is to divide the error 
s.h. into components each of which is homogeneous, as indicated in 
section 3.1)3. 

The same problem may arise because the experimental errors follow a 
distribution that is decidedly skew. In such distributions the error 
variance for a treatment tends to be a function of the mean produced by 
the treatment. If the nature of the functional relationship is known, a 
transformation can be found that will place the data on a scale on which 
the error variance is more nearly constant. This transformation is then 
made on the observations before starting the analysis. The principal 
transformations that have been found useful are discussed by Bartlett 
(3.14); they include logs, square roots, and (for data expressed in frac- 
tions) inverse sines. 

Such transformations may also be useful in cases where treatment and 
environmental effects are not additive. If, for instance, a treatment in- 
creases all observations by 20%, irrespective of the initial level, a change 
to logs will introduce additivity. When transformations are made for 
this purpose, it should be realized that they will also affect the distribu- 
tion of the experimental errors. Fortunately, it often happens that such 
transformations also bring the distribution of errors closer to normality* 

Finally, the assumption that the errors are independent from observa- 
tion to observation may be obviously untenable. It is well known that 
crop yields on neighboring plots tend to be positively correlated, and in 
laboratory experiments observations made by the same person at about 
the same time tend to exhibit the same type of correlation. These cor- 
relations might completely vitiate tests of significance. The remedy in 
this ease is the proper use of randomization, which, as it were, introduces 
independence in the assignment of treatments to the experimental units 
or in the assignment of the order in which observations are made, so that 
the errors may effectively be regarded as independent. For further dis- 
cussion of this question, see Yates (3.15), Fisher (3.3), Bartlett (3.16), 
and Welch (3.17). 
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CHAPTER 4 


COMPLETELY RANDOMIZED, RANDOMIZED BLOCK, 

AND LATIN SQUARE DESIGNS 

4.1 Completely Randomized Designs 

4,11 Description. The simplest type of layout is that in which treat- 
ments are all otted to the units entircET^ chanc e. More specifically, if 
a treatment isLoT^applieil to four units, for example, the randomization 
gives every group of four units in the experimental material an equal 
probability of receiving the treatment. In addition the units should be 
processed in random order at all subsequent stages of the experiment 
where this order is likely to affect the results. 

This design has several conveniences: 

1. Compile Any number of treatments and 

of replicates mayTelised. The number of replications can be varied at 
will from treatment to treatment (though such variation is not recom- 
mended without good reason). All t he available experimental ma terial 
can j}ejalilized — an advantage in small preliminary experiments where 
the supply of niaterial j&jsearce, 

2. The stat istical analysis is eas y even if the numbers of replicates are 
not the same for all treatments or if the experimental errors differ from 
treatment to treatment. 

3. The method of analysis remains simple when the results from some 

units or from whol^ or are rejected. Moreover, 

the relative loss of information due to missing data is smaller than with 
any other design. 

The principal objection to a completely randomized design is on the 
grounds of accuracy. Since the randomization is not restricted in any 
way to ensure that the units which receive one treatment are similar to 
those which receive another treatment, the whole of the variation among 
the units enters into the experimental error. For this reason the error 
« can often be reduced by the use of a different design, unless the units are 
highly homogeneous or the experimenter has no information by which 
to arrange or handle the imits in more homogeneous groups. 

Complete randomization seems the obvious procedure for many lab- 

86 
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oratory experiments, e.g., in physics, chemistry, or cookery, where a 
quantity of material, after thorough mixing, is divided into small sam- 
ples or batches to which the treatments are applied. On the other hand, 
these designs are seldom used in field experiments, the method of ran- 
domized blocks having been found consistently more accurate. 

One fact compensates to some extent for the higher experimental er- 
rors as compared with other designs. For a given number of treatments 
and a given number of experimental units, complete randomization pro- 
vides the maximum number of degrees of freedom for the estimation of 
error. As pointed out in section 2.31, the sensitivity of the experiment 
increases as the number of error degrees of freedom is increased. Con- 
sider, for example, an experiment with 2 treatments in 4 replicates. The 
degrees of freedom for error are 6 under complete randomization as 
against 3 if the method of pairing (section 4.2) is used. From section 
2.31 it appears that the paired experiment must produce about a 14% 
reduction in the error variance in order to offset the additional un- 
reliability in the estimate of error. This point is worth bearing in mind 
with small experiments. 

To summarize, complete randomization may be appropriate (i) where 
the experimental material is homogeneous, (ii) where an appreciable 
fraction of the units is likely to be destroyed or to fail to respond, and 
(iii) in small experiments where the increased accuracy from alternative 
designs does not outweigh the loss of error degrees of freedom. 

4.12 Randomization. If the number of units does not exceed 16, 
tables 15.6 and 15.7 may be used. Suppose that there are 3 treatments, 
of which two have 4 replicates while the third has 8 replicates. Num- 
bers 1-16 are assigned to the units in any convenient order. A random 
permutation is drawn from table 15.7, say 

9, 13, 8, 5, 12, 1, 14, 16, 6, 7, 3, 4, 10, 11, 15, 2 

The first treatment is applied to units 9, 13, 8, 5; the second to units 12, 
1, 14, 16; and the third to the remainder. 

With more than 16 imits in the experiment, numbered discs, beans, 
or a book of random numbers may be used, as described in chapter 15. 

4.13 Statistical Analysis. Table 4.1 shows some of the results of an 
experiment on the effects of applications of sulphur in reducing scab dis- 
ease of potatoes. The object in applying sulphur is to increase the acidity * 
of the soil, since scab does not thrive in very acid soil. In addition to 
untreated plots which serve as a control, 3 amounts of dressing were 
compared — 300, 600, and 1200 lb. per acre. Both a fall and a spring ap- 
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plication of each amount was tested, so that in all there were 7 distinct 
treatments. The sulphur was spread by hand on the surface of the soil, 
and then disced in to a depth of about 4 inches. The quantity to be 


TABLE 4.1 Field plan and scab indices for a completely randomized 

EXPERIMENT ON POTATOES 


F3 

0 

SO 

Ft 2 

so 

S12 

S3 

F6 

9 

12 

18 

10 

24 

17 

30 

16 

0 

S3 

FI 2 

F6 

S3 

0 

0 

S6 

10 

7 

4 

10 

21 

24 

29 

12 

F3 

SI 2 

1 FC> 

0 

F6 

SI 2 

F3 

F12 

9 

7 

38 

30 

18 

16 

16 

4 

S3 

0 

S12 

S6 

0 

FI 2 

0 

F3 

9 

18 

17 

19 

32 

5 

26 

4 


Notation: F = fall, S = spring application, 0 ~ control. The numbers 3, 6, 12 
are the amounts of sulphur in 100 lb. per acre. 


Results grouped by treatments 



0 

F3 

S3 

F6 

S6 

FI 2 

S12 


12 30 

9 

30 

16 

18 

10 

17 


30 18 

9 

7 

10 

24 

4 

7 


32 

16 

21 

18 

12 

4 

16 


29 26 

4 

9 

IS 

19 

5 

17 

Totals 

181 

38 

67 

62 

73 

23 

57 G - 501 

Means* 

22.6 

9.5 

16.8 

15.5 

38.2 

5.8 

14.2 




Analysis 

of variance 

Source of variation 

d.f. 


S.8. 

Treatments 

fl- 

•1) = 

6 

972.3 

Error 

CW ■ 

-0 = 

25 

1122.9 

Total 

(N- 

- 1) = 

31 

2095.2 


* Denotes significance at the 5% level 


m.s. 

162.0 

44.9 


F 

3.61 * 


analyzed is the “scab index/ 7 This is, roughly speaking, the percentage 
of the surface area of the potato that is infected with scab. It is obtained 
by examining 100 potatoes at random from each plot, grading each po- 
tato on a scale from 0 to 100% infected, and taking the average. 
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The design was a completely randomized one, with 4 replications of 
each sulphur dressing and 8 replications of the control, which received 
extra replication in order to obtain a fairly good estimate of the natural 
infestation. Incidentally, a randomized blocks layout might have been 
superior. 

The computations are very simple. Let y denote an observation, T{ a 
treatment total, G the grand total, r t - the number of replications of the 
ith treatment, N — ^ the total number of observations, and t the 
number of treatments. 

Step 1, Find the treatment totals and the grand total. 

• Step The sums of squares are computed as follows. 

Correction factor: C — %- = = 7843.8 

N 32 

Total: X) T - C = (9) 2 + U2) 2 + • * • + (4) 2 - 7843.8 * 2095.2 


Treatments: 2 — — C ; 

r t 


(183 ) 2 (38) 2 + (07) 2 +■ 


•+ (57) 2 


• 7843.8 


D72.3 


Error: (Total s.s.) ~~ (treatments s.s.) — 2095.2 — 972.3 


- 1122.9 


The analysis of variance is given at the foot of table 4.1. The 
ratio for treatments is significant at the 5% level. From the treatment 
means it appears that all dressings had some beneficial effect, and that 
the fall application was more effective than the spring one. There is 
little or no evidence that the higher dressings were more effective than 
the lowest dressing. If the summary is conducted from this point of 
view, we should isolate and test two individual components of the treat- 
ments s.s. : (i) a component measuring the average effect of all dressings, 
(ii) a component comparing the fall and spring applications. The follow- 
ing computations are required. 

Average effect of sulphur . The total over all dressings is 320, repre- 
senting 24 plots. Since the control total, 181, represents 8 plots, the 
comparison is 

3(181) - 320 - 223 

The contribution to the sum of squares in the analysis of variance is 
(223) 2 /9(5 j or 518.0. By the rules in section 3.42, the divisor, 96, is 
[(9) (8) +24]. 

Fall versus spring application , The comparison is 

(38 + 62 + 23 - 67 - 73 - 57) = -74 

It is obviously orthogonal to the previous comparison. The contribu- 
tion to the sums of squares is (74) 2 /24, or 228.2. These calculations 
lead to the analysis of variance in table 4.2, 
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The remaining four components of the treatments s.s. must represent 
comparisons among the levels of sulphur. The average reduction in scab 
due to sulphur is significant at the .1% level; while the superiority of the 
fall application is also significant. Differences among the levels show no 
sign of significance. 


TABLE 4.2 Subdivision of the treatments sum of squares in the 

EXPERIMENT ON POTATO SCAB 


Source of variation 

d.f. 

s.s. 

m.s. 

F 

Treatments 

6 

972.3 

162.0 

3.(51 * 

Control vs. sulphur 

l 

518.0 

518.0 

11.54** 

Fall vs. spring application 

l 

228.2 

228.2 

5.08 * 

Comparisons among levels 

4 

226.1 

56 . 5 

1.20 

Error 

25 

1122.9 

44.9 



* Denotes significance at the 5% level. 

** Denotes significance at the 1% level. 

The conclusions might be phrased as follows. “The application of 
sulphur produced a significant decrease in the scab index, the averages 
being 22.6 for the untreated plots, 16.4 for plots with the spring applica- 
tion, and 10.2 for plots with the fall application. The fall application 
proved significantly better than the spring application. There was no 
indication that the higher levels of dressing were more effective than the 
lowest level” It should be remarked that a continuation of this experi- 
ment and other experiments caused these conclusions to be modified, 
the higher levels having shown to better effect. 

A more extensive discussion of the analysis of experiments of this type 
is given by Snedecor (4.1). Note that, if certain observations are missing 
or have to be rejected, the method of computation remains exactly the 
same except for the slight complication that different treatments will 
usually have different numbers of replications. 

The practice of calculating a separate standard error for each treat- 
ment is still found in some types of experimentation. Unless the treat- 
ments have different error variances, the use of a pooled error, as given 
by the analysis of variance, is recommended since more sensitive tests of 
significance are obtained. In cases where the experimental error appears 
to vary considerably from treatment to treatment, a test of homogeneity 
of the error variance, due to Bartlett (4.2), may be made. This test is 
also described in Snedecor (4.1, section 10.13). 

It is worth emphasizing that, even if the treatments are randomly as- 
signed to the units, the above methods of analysis do not apply when 
restrictions on the randomization are introduced at a later stage in the 
experiment. Consider a chemical experiment in which samples of liquid 
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from the same bottle are treated in four different ways. The samples, 
being mutually indistinguishable, are assigned at random to the treat- 
ments. It is, however, practicable to treat only a few samples during 
each work period, and the investigator decides to complete a single 
replication at each session. The result of this additional control is that 
the differences among replicates are eliminated from the experimental 
errors and must equally be eliminated from the estimated errors. The 
analysis should follow the method described in section 4,2. This point 
will arise frequently in experiments where many physical operations 
must be carried out. 


4.14 Standard Errors. The estimated standard error of the difference 
between two treatment means is 


s ( i = 



This formula applies when a pooled error is used, s 2 being the pooled 
error m.s. per unit, and r\ and r 2 the numbers of replicates for the two 


treatments. If fi = r 2 == r, the formula reduces to 



The degrees 


of freedom for /-tests are those in a 2 . 

Example . The average effect of the sulphur has already been tested 
by an F-test. We will perform the same test by means of a /-test applied 
to the treatment means. Since the control has 8 replications, and the 
mean of all sulphur dressings has 24 replications, 


S,1 = V(44.9)(i + it) = 2.736 


The mean scab index for the control is 22.02, and that for the dressings 
is 13.33. Hence 

22.62 - 13.33 

t = — — - 3.395 

2.736 


with 25 d.f. The value of t 2 is 11.53, in agreement with the F found in 
the analysis of variance. 

If the true error variances per unit are considered to be different for 
the two treatments, the appropriate formula is 



where Sj 2 , s 2 2 are the respective error m.s. per unit for the two treatments. 
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, -r fu p treatment difference to Sd does not follow 

In this case the ratio of t • instances The development of 

Horn l isnu y tables are not readily ac- 

rosponds to our si /n, etc.) wneie uiw rirnKqhlv „ s 

)1(> t i ie following approximation is suggested. It piobablj e is 

slightly on the conservative side, in the sense that the value of t required 

for significance may be slightly too high. t 0 

Lc 4 n\ n >2 be the numbers of degrees of freedom m $i , 82 > lespcc i\ . y. 
From the ordinary /-table record the significance levels t 1} U correspond- 
ing to «i and n-s degrees of freedom, respectively. The approximate sig- 
nificance level for the ratio d/sd is 


where 


Wj/j - 4 - 1112/2 

Wi + W2 




Since /' always lies between the ordinary / values for n 3 and n 2 degrees 
of freedom, this calculation is needed only for those occasional cases 
where d is dose to the borderline of significance. Further, when n x - 
ns = Wj t' is the ordinary t value for n degrees of freedom. 

” tfxumplr. This experiment is a case where one might suspect that the 
error variance would not be homogeneous. If a treatment is very suc- 
cessful in reducing scab, the scab indices for that treatment will all be 
close to zero; hence their variance must be small. With, a treatment 
such as the control the scab index has a greater possible range of vari- 
ation. Consequently, we might expeet the error variance for a treatment 
to depend on the mean produced by the treatment. With only 3 d.f. 
for the estimated error variance per treatment, we cannot hope to detect 
small differences in the true error variances. Perhaps the most important 
comparison is that between the variance for the control and the average 
variance for the sulphur dressings. The error m.s. will be found to be 
70.0 for the control (with 7 di.) and 35.2 for the dressings (with 6X3 
or 18 d.f.). The F-ratio, 1.99, falls just short of the 10% level of sig- 
nificance. 

Consequently, the conclusions wiE be on a sounder basis if we ascribe 
different true error variances to the control and the dressings. For a 
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/-test of the average effect of sulphur we now take 

/too 35.2 ~ / 

s ( i « J h v 8.75 + 1.47 = 3.197 

\ 8 24 

The value of t is 9.29/3.197, or 2.906. To test this at the 1% level by the 
approximate test above we have 

Ui = 7; n >2 — 18; h = 3.499; t 2 ~ 2.878; v>\ = 8.75; w 2 = 1.47 

Hence the value required for significance is 

(8.75) (3.499) + (1.47) (2.878) 

t ‘ — — * — ■ — O.4I0 

10.22 

The average effect of sulphur now falls short of the 1% significance level. 
It remains significant at the 5% level. For comparisons among the dif- 
ferent sulphur dressings we would use ordinary /-tests, where s 2 is taken 
as 35.2 with 18 d.f. The effect will be to enhance the significance of 
comparisons among the dressings, though the conclusions quoted pre- 
viously are not altered. 

As mentioned in section 3.9, it is usually better to handle this type of 
experiment, where the error variance is thought to be a function of the 
mean, by converting the original data to a scale on which the error vari- 


TABLE 4.3 Analysis of variance 

OF THE 

SQUARE 

ROOTS OF THE 

SCAB INDICES 

Source of variation 

d.f. 

S.S. 

m.s. 

F 

Treatments 

6 

18.22 

3.04 

4.19 

Control vs. sulphur 

1 

8.17 

8.17 

11.27 

Fall vs. spring application 

1 

4.59 

4.59 

6.33 

Comparisons among levels 

4 

5.46 

1.36 

1.88 

Error 

25 

18.12 

0.725 


Error for controls 

7 

6.27 

0,896 


Error for dressings 

18 

11.85 

0.658 


Total 

31 

36.34 




ances will be more nearly constant. On the supposition that the error 
variance may be proportional to the mean, the square roots of the scab 
indices might have been analyzed. As a matter of interest this analysis 
is shown in table 4,3; square roots were recorded to only one decimal 
place. 

The error m.s, for the controls is now closer to that for the dressings, 
the F-ratio being 1.36 against 1,99 in the original analysis. Further, the 
F-ratios for most components of the treatments s.s. are higher than in the 
original analysis, suggesting an increase in the sensitivity of the analysis. 
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4.2 Single Grouping : Randomized Blocks 

4.21 Description. The essence of this design is that the es^ime^al 
mat erial is d i vided into group s, each of which constitutes a singlejrial 
oiMvpJication. At all stages of the experiment the object is to keep the 
experimental errors within each group as small as is practicable. Thus, 
^ ^ wlien the units are assigned to the successive groups, all units whi ch go 

in the same group should be closely comparable. Similarly, during the 
course of the experiment, a uniform technique should be employed for 
all units in the same group. Any changes in technique or in other con- 
ditions that may affect the results should be made between groups. 

This division into replications need be recognized only at those stages 
in the conduct of the experiment where the division may help to reduce 
experimental errors. In agricultural field experiments the division is 
made at the start when the plots are marked out in the field. Since 
neighboring plots are known to be more alike in fertility than plots some 
distance removed, each replicate consists of a compact group of plots 
and is made approximately square in shape if feasible. Cultivations 
designed to keep the land clean of weeds will usually be carried out with- 
out regard to the replications, because it is not believed that results are 
affected by the order in which plots are cultivated. Similarly, the plots 
will generally be harvested in whatever order is most convenient. If, 
however, harvesting must be spread over a number of days, it is well to 
harvest the plots replication by replication, in case rainfall or other 
factors should produce changes in the weight of the crop from day to 
day. In other types of experimentation no real distinction might be 
made between replicates' "until relatively late in the experiment. 

The principal advantages of randomized blocks are as follows. 

L By means of the grouping, more accurate results are usually ob- 
tained than with completely randomized designs. 

2. An y number of treatments and any numb er of replicates may be 

i ncluded . have 

the same number of replicates. If extra replication is desired for some 
treatments, each of these may be applied to two units within every 
group. This device provides twice the standard number of replicates 
for the treatments in question, at the expense of some increase in the size 
of the group, which now contains more than a single replication. Sim- 
ilarly a treatment may be applied three or four times in a group. 

3. The statisti cal analysis is strai ghtforward. Mishaps which neces- 
sitate the omission of a complete group or of t&e entire data from one or 
more treatments do not introduce any complication in the analysis. 
When data from some individual units are lacking, the “missing-plot” 
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technique developed by Yates enables the available results to be fully- 
utilized. Some extra computational labor is, however, involved, and if 
the gaps are numerous the design is less convenient in this respect than 
complete randomization. 

4. If the experimental error variance JsJn4^iLd.ox sorne treatments 
than for others, an unbiased error for testing any sp ecific combin ation 
of the treatment means can still be obtame dY 

No design is more frequently used than randomized blocks. Certainly 
if a satisfactory degree of precision is reached, there is little need to search 
for alternative designs. 

It is worth noting that the replication means provide unbiased com- 
parisons of the differences among replicates. Occasionally, these dif- 
ferences measure some property of the experimental material that is of 
interest. The variance-ratio test of the replications against the error 
m.s. requires some care in its interpretation. In a greenhouse experi- 
ment, for instance, a different type of soil might be used in each replica- 
tion. Significant differences among replications might be due either to 
differences in soil type or to differences in the positions of the replications 
within the greenhouse. 

4.22 Randomization. When the units have been grouped, the trea t- 
me nts are assigned at random to the un it s with in each group. A new 
randomization is made f or every group . UnlessThFnumber of treat- 
ments exceeds 16, tables 15.6 and 15.7 are convenient for this operation. 

4.23 Statistical Analysis. The example comes from an experiment 
carried out by the North Carolina Agricultural Experiment Station at 
Rocky Mount, N. C., in 1944. The experiment tested the effects of 5 
levels of application of potash, supplying respectively 36, 54, 72, 108, 
and 144 lb, K 2 0 per acre, on the yield and properties of cotton. The 
measure chosen for analysis is the Pressley strength index. This is 
found by measuring the breaking strength of a bundle of fibers of a given 
cross-sectional area. A single sample of cotton was taken from each plot, 
and 4 determinations were made on each sample. The figures in table 
4.4 are the means of these 4 samples.* The experiment was arranged 
in 3 randomized blocks of 5 plots each. 

To make the computing instructions more general we suppose that 
there are t treatments and r replicates, and that y denotes a typical ob- 
servation. 

* Since the machine which measures the index is calibrated in arbitrary units, no 
dimensions are ascribed to the data in table 4.4. The index can be converted ap- 
proximately into pounds per square inch by means of a regression formula. 
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Step /. Find the treatment totals {Td, the replicate totals (Rj), and 
the grand total ( G ). 


TABLE '1.4 Strength index of cotton in a randomized blocks experiment 


Replications 

Treatments . * ' Coded 


Pounds K 2 O per acre 1 

2 

3 Totals dressing 

36 

7.62 

8.00 

7.93 23.55 13 

54 

8.14 

8.15 

7.87 24.16 . 8 

72 

7.76 

7.73 

7.74 23.23 3 

108 

7.17 

7.57 

7.80 22.54 -7 

144 

7.46 

7.68 

7.21 22.35 -17 

Totals 

38.15 

39.13 

38.55 115.83 


Analysis 

| 

1 

o 

Source of variation 

d.f. 


s.s. m.s. F 

Implications 

(r-0 

- 2 

0.0971 

Treatments 

(t - 1) 

= 4 

0.7324 0,1831 4.19 

Error 

(r ~ m - 1) 

S3 g 

0.3495 0.0437 

Total 

rt — 1 

- 14 

1.1790 


Subdivision of the treatments s.s. 


d.f. 

s.s. 

m.s. F 

Linear response 

1 

0 . 5663 

0,5663 12.96** 

Deviations 

3 

0,1661 

0.0554 1.27 


Step 2. The suras of squares are obtained as follows. 

r , . , „ G 2 (115.83) 2 

Correct ion laetor: C « — ** 

tr h> 

Total: 2 f - O = (7.02/ 2 + (8.14) 2 + • • • + (7.21) 2 - 894.4393 


Ueplications: 2 
Treatments: 


5l. 

t 


(38.15) 2 + (39.13) 2 + (38.55) s 


894,4393 


V Tl „ C . = (23-5S) 2 + (24.16) 2 +• • • + (22.35 ) 2 __ 


894.4393 


894.4393 

1.1790 

0.0971 

0.7324 


r 3 

Error: (total s.s.) - (replications s.s.) - (treatments s.s.) 

- 1.1790 - 0.0971 - 0.7324 = 0.3495' 


The F-ratio for treatments (table 4.4) is 0.1831/0.0437, or 4.19, which 
is significant at the 6% -level with 4 and 8 d.f. The treatment totals 
suggest that the strength decreases with increasing applications of 
potash, though there is a hint of a maximum in strength for the 54-lb. 
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application. Accordingly it is worth while to examine the shape of the 
response curve. We will first fit a linear regression on the amount of 
dressing. Note that successive increments of dressing are not equal, the 
last two increments being twice as large as the first two. This is a 
fairly common practice in cases where the effectiveness of an increase in 
dressing is expected to be smaller at the higher levels of application. 

Although the regression can be calculated by the standard formula, 
a simple coding of the dressings lightens the work. If we place the 
lowest dressing (36 lb.) at zero and take 18 lb. as 1 unit on the scale, the 
dressings may be coded as 0, 1, 2, 4, and 6, respectively. A further de- 
vice is to subtract a common amount from these values so that their 
mean is zero. Since the mean of the coded dressings is this is the 
amount that must be subtracted. To avoid fractions, we multiply all 
values by 5 and subtract 13 from each product. The resulting coded 
dressings are shown in table 4.4; the signs have been changed so that the 
regression coefficient will be positive. 

The regression coefficient is now obtained as 


(13) (23.55) + (8) (24.16) + (3) (23.23) - (7) (22.64) - (17)(22.35) 


31.39 

mo 


(3)[(13) 2 + (8) 2 + (3) 2 + (7) 2 + (17) 2 ] 
0.0180 


The factor (3) in the denominator is inserted to convert the treatment 
totals to means. The regression coefficient represents the average de- 
crease in strength for a unit increase on the coded scale. Since 5 units 
on the coded scale correspond to 18 lb. I£ 2 0, the decrease in strength for 
each additional 18 lb. K 2 0 is estimated as 0,090. 

The contribution of the regression to the sum of squares in the analysis 
of variance is (by rule 1, section 3.42) 


(31.39) 2 

1 mT 


- 0.5663 


as shown in the analysis of variance. The contribution from the linear 
regression is significant at the 1% level. Since the mean square for 
deviations from the regression, 0.0554, is only slightly above the error 
m.s., there seems no point in investigating a quadratic regression, though 
the reader may care to verify that its contribution is small. 

The conclusion from the data is that increased dressings of potash 
produce a weaker fiber, the strength index declining by 0.090 for each 
18 Ib.-increment in K 2 0. As an exercise we will obtain confidence limits 
for this rate of decline. We first require the estimated standard error for 
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the 0.090 figure. This can be found by expressing the figure as a linear 
function of the treatment means and applying rule 5a of section 3.51. 
We have 


0.090 = -rhi^Fh + ~ 1 !> i - ,7 ^J 


so that the estimated standard error is 


■yr 


V 0.04 37 5 
~~\/§ 580 


- 5 - 


Vo.0437 

yrw 


V / (13) 2 + (8) 2 +---+ (17) 2 


0.0251 


Alternatively, this value could be derived by n oting t hat the square root 
of the F-value for the linear regression, i.e., V 12.96, or 3.6, is the ratio 
of 0.090 to its estimated standard error. Thus the latter must be 
0.09/3.6, or 0.0250. Finally, for the 80% confidence limits, we have 
(0.090 ± 0.025 X 1.397), or (0.090 ± 0.035), where 1.397 is the value of 
t for a probability 0.20 and 8 d.f. 


4.24 Standard Errors. The estimated standard error of the difference 
between two treatment means is 


Sd — 



If some treatments receive extra replication, the general formula is 



In experiments where the error appears to be heterogeneous, a separate 
error may be obtained for any pair or for any group of treatments. For 
this a new randomized blocks analysis is carried out on those treatments 
which belong to the group under consideration. 


4.25 Missing Data. The method of analysis when part of the data is 
missing is described by Yates (4.4). With a single missing unit, the first 
step is to calculate a value for the unit by means of the formula 


rB + tT - G 
(r - 1 )(t - 1) 


(4.1) 


where B is the total of the remaining units in the block where the missing 
unit appears, T is the total of the yields of this treatment in the other 
blocks, and G is the grand total; r and f are the numbers of replicates and 
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treatments respectively. The analysis of variance is then carried out as 
usual except that 1 d.f. is subtracted from the total s.s. and from the 
error s.s. 

The standard error of the difference between the mean of the treat- 
ment with a missing value and the mean of any other treatment is 



(4.2) 


When there are several missing values, for units a, b, c, d, ■ * * , we first 
guess values by inspection for all units except a . Formula (4.1) is then 
used to find an approximation for a. With this approximation and the 
values previously assumed for c, d, • * ♦, we again use formula (4.1) to in- 
sert an approximation for 5. After a complete cycle of these operations, 
a second approximation is found for a and so on until the new approxi- 
mations are not materially different from those found previously. The 
analysis of variance is then completed; for each missing unit, 1 d.f. is 
subtracted from the total and error s.s. 

Suppose that in the previous example two observations had been 
missing and that the data had appeared as follows. 


Replications 


Pounds Iv&O 

1 

2 

3 

Totals 

36 

a 

8.00 

7.93 

15.93 

54 

8.14 

8.15 

7.87 

24.16 

72 

7.76 

b 

7.74 

15.50 

108 

7.17 

7.57 

7.80 

22.54 

144 

7.46 

7.68 

7.21 

22.35 

Totals 

30.53 

31,40 

3S.55 

100.48 


Since differences between replications are not pronounced, we might 
take as a trial value -for a the mean of 8.00 and 7.93, or 7.96. For 
estimating bi, the first trial value of b, by formula (4.1) we now have 
B = 31.40; T = 15.50; G = 100.48 + 7.96 = 108.44 

so that 

, (3)(31.40) + (5)(15.50) - 108.44 

Oi = = i ■ 9i 

8 

For estimating a 2 , the second trial value of a, we now have 

B = 30.53; T = 15.93; G = 100.48 + 7.91 = 108.39 

so that 

(3) (30.53) + (5)(15.93) - 108.39 
a 2 = 7.8b 
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Taking a 2 as 7.80, we find that the next trial value of b is 7.92. This is so 
close to the previous value that we may stop. Thus a = 7 .86, b = 7 .92. 

The analysis of variance is now computed with these values inserted. 
There will be 12 di. in the total s.s., and 6 d.f. in the error s.s. The error 
m.s. is 0.049 L 

To obtain a standard error for the comparison of two treatments, A 
and £, we assign an “effective” number of replicates to each treatment. 
Any replication of treatment A is counted as 1 when both A and B are 
present in the replicate, as 14 when A is present but B is not, and as 0 
when A is missing. The same rule is applied in scoring B. Suppose that 
in the example we are comparing the means of the 36- and 72-lb. dress- 
ings. The effective replication for each me an is scored as V /% The 
standard error of their difference is taken as Vs 2 (% + %), or Vl.333s 2 . 
If we are comparing the mean of the 36-lb. dressing with that of the 54- 
lb. dressing, which has no missing values, the score for the first mean is 
2, and that for the secon d 2 so t hat the standard error of their dif- 
ference is "+*%), or V0.9s 2 , 

This useful approximation is due to Yates; the exact formulae are 
laborious to compute. The approximation usually gives a standard 
error that is slightly too high. In the example the correct values for the 
standard errors of the two differences considered above are 1.069s and 
0.9375, as compared with the approximate values of 1.155a and 0.949s, 
respectively. 

As a result of the disturbance introduced by the missing units, the 
treatments m.s. is slightly too large; however, the variance-ratio test is 
unlikely to be much in error unless a substantial proportion of the units 
are missing. Yates (4.4) gives the method for obtaining an exact test. 


4.26 Estimation of Efficiency, If Eb and E e are the block and error 
m.s. and n b) n t) and n v the block, treatment, and error degrees of freedom, 

w n bEb + (n t + n e )E e oN 

iic.r. ~~ (4-3) 

^6 4 " 4 * 


is an estimate of the error variance of a completely randomized design 
with the same experimental material. Comparison of 2? 0 . r . and E e , 
taking account of the change in numbers of error degrees of freedom 
(section 2.31), provides an estimate of the increase in accuracy which 
results from the grouping into replicates. 

The result in (4.3) may be proved in various ways. Since there is no 
complete discussion of results of this type in the literature, one method 
of proof will be sketched; the details, which are a matter of algebraic 
manipulation, will not be given completely. This proof uses the proper- 
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ties of randomization. It is not the easiest proof, but requires very few 
assumptions. 

Let the experiment have i treatments and r replications, and let ei 
be the experimental error of the observation on the ith unit, and tj the 
effect of the jth treatment. No assumption is made about the nature 
or distribution of the errors. For this reason it is not necessary to 
introduce any specific symbol for the effect of the replication, since 
any type of effect can be represented by appropriate choice of the e’s. 
The treatment effect and the error are assumed to be additive; that is, 
if the randomization happens to put the jth treatment on the ith. unit, 
their joint effect is (77 + <?;)* Throughout the randomization the are 
regarded as a set of fixed numbers, each associated with a specific unit. 

Without loss of generality, we may assume that the total of all ob- 
servations is zero, and that the totals of the c’s and the r’s are both zero. 


Let 



T 


t 



Further, for all possible randomizations of the randomized blocks design 
the replications will of course remain the same; consequently, for any 
given batch of data the replication totals remain unchanged. Let R de- 
note the sum of squares of these totals, divided by t 

The main part of the proof consists in working out the average values 
of the mean squares in the analysis of variance, taken over all possible 
randomizations of each type of design. The results come out as shown 
in table 4.5. 


TABLE 4.5 .Average values of mean squares taken over the 

RANDOMIZATION SETS 


Treatments 

Error 


Completely randomized 
rl.f. m.s. 


it - 1 ) 
t(r - 1) 


rT 8 

it- 1) + (rt ~ I) 

S_ 

(rt - 1) 


Randomized blocks 
di. ' m.s. 

R 

Replications (r — 1) - 

y x rT 8 R 

Treatments (t - 1) — + 

S R 

r{t - 1) r(t - 1) 


Error 


(r - U« ~ 1) 
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This analysis leads at once to the result. Since 

ui =* (r — 1) ; n t + n e = r(t — 1) ; 

it is seen from table 4.5 that for randomized blocks the average value 
of U}$b is R, while that of (nt + n e )E e is (S — R). It follows that the 
average value of 

niEb + (fit 4 ~ n <‘)E c R + S ~~ R ^ S 
n b + n t + n 0 (ft - 1) (rt - 1) 

But this is the average of the error m.s. for a completely randomized 
design on the same data. 

As an illustration of the details, consider the treatments m.s. for ran- 
domized blocks, which is probably the hardest term. The square of the 
first treatment total is 

T\ 2 = (rr i + ei + <?2 4 — * + 6 r ) 2 

When we average over all possible randomizations, there is no contribu- 
tion from terms in re iy since the treatment appears equally often on all 
units, and the total of the e’s is zero. The average value of the contri- 
bution from the r terms in e 2 is the sum of the squares of all the e ? s, 
divided by t, or 8/t, since the treatment appears on any specific unit 
in a fraction 1/t of all randomizations. 

The contribution from terms in e^j may be written 

£3 (#2 4 * C3 + * * * + c r ) + ^2(^1 + £3 + * ' * + e r ) + * * * 

+ e r (ei + c 2 H h e r -i) (4.4) 

The randomization is restricted so that every e comes from a different 
replication. If the subscript denotes the replicate, the mean value of 
is Riffj/t 2 , where Ri is the replication total of the c's. Consequently, 
the mean value of (4.4) is 

1 

2 [RliR‘2 + 4 h Rr) + R2(Rl + R% + * * * + Rr) + ■ * ’ 

+ Rr(Rl + R2 + * * * + — x)] = « ( R \ 2 + R2 2 + * * ‘ + R r 2 ) ~ 

t 2 t 

since the total of the Ri is zero, and since R will be recognized as the sum 
of squares of the R u divided by t. Hence 

E{T\) — r 2 n 2 4- — — — 
t t 

If this expression is summed for all t treatments and divided by r(t - X) 
to give the treatments m.s., the result in table 4.5 follows. 
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4.3 Double Grouping: Latin Squares 

4.31 Description. In the latin square the treatments are grouped in to 
replica tes in two diff emit, , .wavs. Examples of latin squares are shown 
for different numbers of treatments in plan 4.1 (p. 119). It will be noted 
that eve ry row and every column of any square is a complete replicat ion. 
The effect of the double grouping is to elimina te from th e errors all d if- 
f erences among row s andLequally all differe nces among .columns. Thus 
the latin squ are provides more opportun ity than rando mized blocks for 
the re duction of errors by skillful plann ing. 

The experimental material should be arranged and the experiment 
conducted so that the differences among rows and columns represent 
major sources of variation. Some ex ample s of the uses of latin squares 
in various fields of research may indicate the utility of the design. In 
field experiments the plots are usually laid out in a square formation, 
so that soil fertility and other variations in two directions are controlled. 
Variations along and across the greenhouse bench may be similarly 
handled in greenhouse experiments. Occasionally the latin square is 
advantageous even when the plots form a continuous line. In this case 
the rows may be compact blocks of land while the columns specify the 
order within each block. If the yield gradient is suspected to be in the 
same direction all along the line, blocks and order in blocks together re- 
move the effects of the gradient more thoroughly than a single control. 
An example of this type is shown below. 


TABLE 4.6 A latin square with the plots in one continuous line 


Seven treatments A, B, 0, D, E, F, G 

(Experimental limitations force the plot units to go at right angles to gradient.) 
Rep. i n in iv v vi vii 


Main and Tippett (4.5) describe how 4X4 latin squares were em- 
ployed in experiments on the weaving of cotton cloth. The purpose of a 
series of experiments was to investigate the effect of the sizing treatment 
applied to the warp. The criterion was the number of breaks in the warp 
during weaving. Four warps, each with a different sizing treatment, 
were woven simultaneously on 4 different looms, which could be super- 
vised by a single weaver. Then each warp was moved to a different loom 
of the set so that after 4 periods every sizing treatment had been tested 
on all 4 looms. 
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Tf .4, jB, C, D represent the 4 warps, the latin square used was as 
follows. 


Periods 

1 

Looms 

2 

3 

4 

I 

A 

D 

B 

C 

II 

D 

C 

A 

B 

III 

C 

B 

D 

A 

1Y 

B 

A 

a 

I) 


This arrangement eliminates constant differences among the looms, 
which were found to he large, and also differences among the 4 periods 
of weaving. No “period” differences were apparent — a result which 
might be anticipated, the authors suggest, because humidity and warp 
tension were controlled throughout the experiment. 

Frequently, particularly in industry, an experiment requires a series 
of operations each of which may introduce variability into the final re- 
sults. In such eases the latin square may be useful in a preliminary in- 
vestigation of the sources of variation. For example, in the preparation 
of an explosive mixture used in primers, variation may occur either in 
the mixing of the ingredients of the explosive or in the process of charg- 
ing. One experiment of this type involved 4 mixing-blending teams and 
4 charging operators. On each day, the product of each teaniwas sent 
to a different charging operator, the arrangement being changed daily 
according to the following 4 X 4 latin square (letters W, X , Y, and Z 
represent the mixing-blending teams). 


Charging operators 



1 

2 

3 

4 

Monday 

W 

£ 

X 

Y 

Tuesday 

X 

W 

Y 

Z 

Wednesday 

y 

X 

Z 

w 

Thursday 

z 

r 

w 

X 


The latin square analysis of variance enables us to isolate consistent dif- 
ferences amongst the teams and consistent differences amongst the 
chargers, as well as day-to-day variations. 

To quote a more complex example, a 12 X 12 latin square was used 
by Chen, Bliss, and Robbins (4.6) to estimate the toxicities of 12 poisons 
when applied to cats, A single replication of this experiment occupied 
a complete day, and the experiment was repeated for 12 days. Each 
drug was injected into the femoral vein of a cat at 1 cc. per minute until 
the cat died, and the measurement recorded was the dose required to 
cause death. An observer could administer only two of the drugs at a 
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given time. Consequently, three observers were used, each treating 
two cats in the morning and two in the afternoon. The rows of the latin 
square were used to eliminate systematic differences between observers 
and between the morning and afternoon injections. The columns repre- 
sent days. The experimental plan was as follows (letters represent 
drugs). 


Time 

Observer 

Day 

i 

2 

3 

4 

I 

5 

6 

7 

1 8 

1 0 

10 

U 

12 

10:30 a.m. 

I 

! 

i I 

J 

B 

I 

L 

H 

G 

F 

K 

D 

j E 

A 

i c 



; K 

G 

J 

II 

I 

B 

L 

G 

E 

; F 

D 

A 


11 

B 

L 

G 

C 

D 

J 

K 

E 

II 

A 

F 

1 



E 

D 

F 

G 

J 

K 

A 

L 

a 

I 

1 B 

II 


III 

C 

K 

A 

B 

F 

L 

I 

D 

G 

II 

J 

E 



F 

II 

K 

E 

G 

C 

D 

B 

A 

L 

I 

J 

2:30 p.m. 

1 

J 

a 

E 

K 

A 

I 

H 

F 

B 

G 

L 

D 



D 

F 

I 

A 

L 

E 

C 

G 

J 

B 

H 

K 


II 1 

A I 

B 

C 

D 

E 

F 

G 

H 

I 

J 

K 

L 



II 

E 

L 

J 

a 

A j 

B 

1 

K 

D 

G 

F 


III 

a | 

I 

D 

F 

K 

II ! 

J 

A i 

L 

C 

E 

B 

< 


L ! 

A 

II 

I 

B 

D 

E 

J 

F 

K 

a 

G 


During the course of the experiment every drug was given equally 
often by each observer and equally often in the morning and afternoon. 
The arrangement is a good example of the way in which a single grouping 
(rows) can control simultaneously more than one potential source of 
variation. All three major variables — day, time of day, and observer — 
would have inflated the experimental error if left uncontrolled. 

4.32 Number of Replications. The chief restriction on the utility of 
the latin square is that the number of replicates equals the number of 
treatments; if the latter is considerable, the number of replications re- 
quired becomes impractical. Squares larger than 12 X 12 are seldom 
used, while the most common range is from the 5 X 5 to the 8 X 8 
square. Latin squares also suffer to some extent from the same dis- 
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advantage as randomized blocks in that the experimental error per unit 
is likely to increase with the size of square. 

The small squares provide onjv a, few degrees., of freedom for the 
estimation of error— none with the 2X2, two with the 3X3, and six 
witli The 4 X 4. This fact precludes the use of single 2X2 squares, 
while the 3X3 and 4X4 squares must produce a substantial reduction 
in error over randomized blocks or complete randomization to counter- 
balance the loss of degrees of freedom. More than one square may, 
however, be included in the same experiment. Three 3X3 squares 
(9 replicates) furnish 10 error d.f., while two 4X4 squares give 15 di., 
provided that in each case the “squai’es X treatments” interaction can 
be pooled with error. A considerable number of 2 X 2 squares would be 
required, since the error degrees of freedom are 1 less than the number of 
squares. 

4.33 Randomization. A complete representation of the squares from 
4 X 4 to G X 6 and sample squares up to the 12 X 12 is given by Fisher 
and Yates (4.3). For squares up to the G X 6, the randomization pro- 
cedure given in this reference selects (with a minimum of labor) a square 
at random from all latin squares of a given size. For a discussion of the 
theoretical basis of the randomization, see Yates (4.7). 

Examples of latin squares are shown in plan 4.1 (p. 119) . The method 
of randomization for plan 4.1 is as follows. 

3 X 8. Arrange the columns at random and the last 2 rows at random. 

4X4 ■ Select at random one of the 4 squares. Arrange at random all 
columns and the last 3 rows. It is equally good, though not strictly 
necessary, to randomize all rows and columns. 

o X » aiid higher squares . Arrange all rows, columns, and treatments 
independently at random. 

For 3X3 and 4X4 squares, this procedure selects one square at 
random from all possible squares. For 5 X 5 and larger squares, some 
types of squares have no chance of being selected if plan 4.1 is used. 
Unless latin squares are used very frequently, however, the randomiza- 
tion sets are sufficiently large for experimental plans. 

4.34 Statistical Analysis. Several experiments have demonstrated 
that people find it difficult to select, by personal judgment, unbiased 
samples even from relatively small populations that can be thoroughly 
inspected before selection. In this experiment, each population con- 
sisted of a small area of wheat, containing about 80 shoots, the shoots 
being slightly over 2 feet high. There were 12 samplers, all experienced 
in studying the growth of wheat. Each sampler inspected each area and 
measured the heights of 8 shoots whose heights were to give a represent- 
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ative sample of the shoot heights in the area. The quantity that will be 
analyzed is the difference between the mean height of the 8 selected 
shoots and the true mean height in the corresponding area, in other 
words the sampler's error. 

The samplers were divided into two groups of six, of which we will 
consider only the first group. The samplers represent the experimental 
treatments. There were 6 areas, each sampled by all 6 samplers in the 
group. These areas, which serve as the replications, may be taken as the 
columns of the 6X6 latin square. The rows of the square prescribed 
the order in which each man sampled the 6 areas (see table 4.7). This 

TABLE 4.7 Sampler’s errors in shoot heights (centimeters) 

G X 6 LATIN SQUARE 


Order 

Areas 

Totals 

1 

2 

3 

4 

5 

6 

I 

£ + 3.5 

£ + 4.2 

4+6.7 

£>+6.6 

0+4.1 

£+3,8 

+28.9 

11 

£ + 8.9 

F+ 1.9 

£>+5.8 

A +4.5 

£7+ 2.4 

0+5.8 

+29.3 

III 

0+ 9.6 

£+ 3.7 

F— 2.7 

B +3.7 

D+6.0 

A +7.0 

+27.3 

IV 

£>+10.5 

C+10.2 

B+4.6 

£+3.7 

A +5.1 

£+3.8 

+37.9 

V 

3.1 

A + 7.2 

0+4.0 

£-3.3 

£+3.5 

£>+5.0 

+19.5 

VI 

A+ 5.9 

£>+ 7.6 

£~ 0.7 

0+3.0 

£+4.0 

£+8.6 

+28.4 

Totals 

+41.5 

+34.8 

+17.7 

+18.2 

+25.1 

+34.0 

+171.3 « G 


Totals for samplers 

A B C D E F 
+36.4 +33.5 +36.7 +41.5 +16.0 +7.2 

feature served two purposes. It helped to prevent the men from getting 
in each other's way, since no two men worked on the same area at the 
same time, and it permitted an examination of the effects of the order of 
sampling on the errors. 

Since only 3 of the 36 errors in table 4.7 are negative, there is evidence 
of a rather consistent tendency towards overestimation of the shoot 
heights. For simplicity, the observations will be referred to as measures 
of bias, though of course they contain a component due to chance 
fluctuations. There is also an indication, on casual inspection of the 
table, that the bias varies from one sampler to another. This point will 
be tested in the analysis of variance. The computations are as follows. 
First find the totals for each row, column, and latin letter and the grand 
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total, G, shown in table 47. The following sums of squares are then 
calculated. 

Correction factor: C = t ~ 5=5 S15.10 

r 30 

Total: (3.6)* 4 (4.2)* 4 — 4 (S.O) 2 - (7 - 1144.73 - 815.10 - 329.63 
Hows: £[(28.9)* 4 — 4 (2S.4) 2 ] - (7 - 843.70 - 815.10 - 28.60 

Columns: &[(41 .6)* 4 • * * + (34.0) 2 ] - C - 893.97 - 815.10 - 78.87 
Treatments: U(WAf 4 * • • 4 (7.2) 2 ] - C - 970.70 - SI 5.10 - 155.60 

The divisor 6 is replaced by r for an r X r square. 

These sums of squares are entered in the analysis of variance. The 
error s.s. is found by subtraction. 


TABLE 4.8 Analysis of variance for a 6 X 6 latin square 


Source of variation 

di. 


s.s. 

m.s. 

F 

Bows (order of sampling) 

(r - 1) 

- 5 

28.60 

5.720 

1.72 

Columns (areas) 

(r - 1) 

= 5 

78.87 

15.774 

4.74 

Treatments (samplers) 

(r - 1) 

= 5 

155,60 

31.120 

9.35 

Error (r 

- l)(r - 2) 

- 20 

66.56 

3.328 


Total 

b 2 - 1) 

- 35 

329.63 




To obtain the mean squares, each sum of squares is divided by the 
corresponding number of degrees of freedom. The F value for samplers 
is 31.120/3.328, or 9.35, well beyond the 1% level, which is 4.10 for 
5 and 20 d.f. This shows that the extent of the overestimation varies 
from one sampler to another. The means for the samplers are given 
below. 


A B C D E F 

4-6.07 45.58 46.12 46.92 42.67 41-20 

The estimated standard err or of each mean is Vs^/r, where s 2 is the 
error m.s., or in this case V (3.328) /6 = 0.745. For testing the dif- 
ference between a pair of means, the standard error is y2 (0.745) = 
1.053. Since the 5% t value for 20 di. is 2.086, the difference between 
two means must be at least (2.086) (1.053), or 2.20, in order to attain 
significance at this level It appears that the samplers fall into two sets— 
,4, B, C, D and E 1 F, the biases being significantly smaller for E and F 
than for the others. 

Some further information about the nature of the biases can be ob- 
tained from an examination of the row, and column totals. From table 
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4.8 the F value for rows is seen to be 1.72, which is below the 5% value, 
2.71. No clear effect of the order in which the areas were sampled has 
therefore been established; further, the row totals (table 4.7) do not sug- 
gest any consistent trend. 

On the other hand, the F value for the columns m.s., 4.74, is significant 
at the 1% level, so that the bias varies from area to area. Previous 
observations had indicated that the bias might depend on the mean shoot 
height of the area. The column totals are ranked below in order of in- 
creasing true mean shoot height. 



True mean 

Column 


shoot height 

total 

Area 

(centimeters) 

(centimeters) 

1 

59,0 

+41.5 

2 

60.2 

+34.8 

6 

72.3 

+34.0 

4 

74.5 

+18.2 

5 

76.0 

+25.1 

3 

76.4 

+17.7 


Apparently on areas where the shoots are not so high the positive bias 
is greater than on areas with relatively high shoots. The result is not 
surprising, since the sampler had to stoop in order to do his work. The 
linear regression of the column totals on the true mean heights contrib- 
utes 59.88 to the sum of squares for columns, 78,87. The mean square 
for deviations from this regression (4 di.) is only 4.75, which is not sig- 
nificantly greater than the error m.s. The inference is that the large 
value of the mean square for columns is due mainly to the negative 
correlation between the bias and the mean height. For further discussion 
of this experiment, see (4.8). 

The analysis of two 4X4 squares is discussed in (4.9). A numerical 
example with five 3X3 squares will be found in (4.10). It should be 
noted that with several latin squares the interaction of treatments with 
squares may be tested. If there is no reason to expect that this inter- 
action is real, the corresponding sum of squares may be combined with 
error. 


4.35 Standard Errors. The formula 



gives the estimated standard 


error of the difference between two treatment means, where s 2 is the 
mean square per unit and r the number of replicates. In cases where 
heterogeneity of errors is suspected, the error cannot be subdivided with 
the same ease as in randomized blocks. It is possible to remove from the 
error s.s. the contribution of any one of the treatments , Yates (4.11). For 
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each yield yi of the treatment, calculate the quantity 

di = Ri + C t - ryi 

where Ri and Cj are the corresponding row and column totals, and r is 
the number of treatments. The sum of squares of deviations of the 
quantities +, divided by r(r — 2), is the contribution of this treatment 
to the error s.s. and has (r — 1) degrees of freedom. For illustration, 
suppose that we wish to calculate the contribution of sampler F to the 
error in section 4.34. From table 4.7 the first d value is 

d x - (+28.9) + (+41.5) - 6(+3.5) - +49.4 

and the remaining values will be found to be +52.7, +61.2, +57.5, 
+29.5, +49.1, with a total +299.4. The sum of squares of deviations 
from the mean is 610.34. Divided by 24, this gives 25.43, with 5 d.f., 
for the contribution of sampler F to the error s.s. Consequently the 
error s.s. (table 4.8) may be divided as shown below. 



d.f. 

s.s. 

m.s. 

Total error 

20 

66.56 

3.328 

Contribution of F 

5 

25.43 

5.086 

Remainder 

15 

41.13 

2.742 


The error m.s. for the suspected treatment may be compared with 
the mean square for the remaining (r — l)(r — 3) degrees of freedom, 
to test whether the treatment shows abnormally high variation. In 
this example the variance ratio is 1.85, with 5 and 15 di. If it is con- 
cluded that the treatment is unusually variable, the mean square with 
(r — 1) (r — 3) degrees of freedom is used as error for the other treat- 
ments. It should be stressed that the ordinary variance-ratio tables can 
be used in this test only if the decision to make the test was taken before 
examining the individual results. If the test is made merely because one 
treatment looks anomalous on inspection, different significance levels 
that take account of this fact are needed, as discussed in section 3.53. 


4.36 Missing Values. Except for changes in the formulae, the pro- 
cedure is similar to that for randomized blocks (section 4.25). A single 
missing value is substituted by means of the formula, reference (4.4) 


r(R + C + T) ~ 2 Q 
(r — l)(r — 2) 


(4.5) 


where R, (7, T, and G are respectively the totals of the row, column, and 
treatment which contain the missing value and the grand total. The 
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estimated standard error of the difference between the corresponding 
treatment mean and the mean of a treatment with no missing values is 


s 2 (- + 1 ' 

V \r (r - 1 )(r - 2 )j 


When several values are missing, repeated application is made of for- 
mula (4.5) as described in reference (4.4). When values have been in- 
serted for all missing data, the usual analysis of variance is calculated. 
One degree of freedom is subtracted from the error d.f . for each missing 
value. 

The exact formula for the standard error of a treatment mean is com- 
plex. Yates (4.4) proposed a useful approximate rule, which shows the 
number of replicates to be assigned to any treatment mean for a com- 
parison with another treatment mean. Each observation on one treat- 
ment is given one replication if the other treatment is present in both 
the corresponding row and column. The replication is % when the 
other treatment is missing in the row or column and is when the other 
treatment is missing in both the row and column. When the treatment 
itself is missing, the replication is 0. Although complicated at first sight, 
the rule is not difficult to apply and is reasonably accurate. Consider for 
instance the following 6X6 square with 3 units missing (two B 1 s and 
one E) as indicated by the parentheses ( ). 


B E C F D A 

F D (. B ) E A (J 

C A E D B F 

(. E) C A (B) F D 

A F D C E B 

DEFACE 


Taking the yields of B and E in the order of the rows, we ascribe the 
following numbers of replicates. 

B: | + 0 + 1 + 0 + 1 + 1 - 3f - -V- 

E: l+^ + -! + 0 + l + l s =4 

Hence the standard error of the difference between the treatment- means 
is V s 2 (f y + £). For the difference between A a nd B, the reader may 
verify that the standard error is V s 2 (-j^ +-J-). Notice that the ef- 
fective replication for a treatment may change from one comparison to 
another. 

The analysis required when a single row, column, or treatment is miss- 
ing is described by Yates (4.11). When more than one row, column, or 
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treatment is missing, consult Yates and Hale (4.12). These methods 
are also presented by DeLury (4.13). 

4.37 Estimation of Efficiency. The effectiveness of either the row or 
the column grouping may be tested from the results of latin squares. 
The expression 

n r E r + (jit T* fle)Ee 

E r = : 

n r + n t + n G 

is an estimate of the error m.s. which would have been obtained if the 
row grouping had not been used, i.e., if the design had been randomized 
blocks with the columns as blocks. In the formula, E r and E e are the 
mean squares for rows and error, respectively, and w n n t , n e are the num- 
bers of degrees of freedom for rows, treatments, and error. The in- 
creased number of degrees of freedom with randomized blocks is taken 
into account by the method of section 2.31. The effectiveness of the 
column grouping is tested similarly. 

A 5X5 experiment on potatoes will serve as an illustration. The 
mean squares for the yields of potatoes were as follows. 



di. 

m.s. 

Rows 

4 

62.6 

Columns 

4 

104.7 

Treatments 

4 

272.0 

Error 

12 

26.2 


Tf the experiment had been conducted in randomized blocks with the 
columns as blocks, the estimated error m.s. would be 

4(62.6) + 16(26.2) 


The degrees of freedom available for error would increase from 12 to 16. 
To take account of the advantage of the additional degrees of freedom 
(section 2.31), we reduce 33,5 by the multiplier (13)(19)/(37)(15), or 
0.969. This gives 32.5 as a comparable mean square for randomized 
blocks. The row grouping increased the information by an estimated 
25%. Of course a considerable number of these comparisons would be 
needed for drawing general conclusions. 


4.4 Cross-over Designs 

4.41 Description. In dairy husbandry and biological assay a design 
has been used which closely resembles the latin square but may have 
some advantages when the number of treatments is small In the sim~ 
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plest case where there are two treatments A and B } the units are first 
grouped into pairs as if a randomized block design were to be used. 
Suppose that from previous knowledge one unit in each pair is expected 
to give a higher response than the other and that the difference in favor 
of the superior unit is expected to be about the same in all pairs. It will 
clearly be advisable to ensure that each treatment is applied to the 
“better” member in half the replicates and to the “poorer” member in 
the other half. The pairs or replicates, which must be even in number, 
are divided at random into two equal sets, the first set to receive treat- 
ment A on the superior member of each pair, the second to receive B. 

The experiment discussed in chapter 1 provides an example. The 
data (shown in table 4.10) are the times required to compute the sums 
of squares of 27 observations on each of two machines whose speeds it was 
desired to compare. Ten sums of squares were calculated, making 10 
replications of the experiment. The cross-over design was used because 
it was thought that the second computation of a sum of squares might 
be faster than the first, so that for a fair comparison each machine should 
be used first in five of the ten replications. The randomization assigned 
machine A to be used first in replications 1, 3, 6, 8, and 9. 

The degrees of freedom in the analysis of variance are as follows. 

di. 


Columns (pairs or replicates) 9 

Rows 1 

Treatments 1 

Error 8 


Total 


19 


Note that only 1 di. is assigned to rows. That is, the arrangement re- 
moves from error only the average difference between the 2 rows. In so 
far as the difference is not constant but varies from one replication to 
another, this variation enters into the experimental error. 

In some experiments it is known that this difference will vary, and it 
is possible to estimate in advance (at least roughly) whether the dif- 
ference will be large or small for a given replicate. Suppose that the dif- 
ference is expected to decrease steadily from replicate 1 to replicate 10. 
With this knowledge a more accurate design is obtained by the use of 
five 2X2 latin squares, as shown below. 


TABLE 4.9 Five 2X2 latin squares for 2 treatments in 10 replicates 


Rows 1 II 

12 3 4 

Better A B A B 

Poorer BA BA 


Square 

III IV V 

5 6 7 8 9 10 

BA A B A B 

A B BA BA 
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In this arrangement we remove from error 5 d.f. for rows, one in each 
square. Thus not only the average difference is removed, but also the 
variation in the difference from square to square. 1 he combined analysis 
of variance of the 5 squares is set out as follows. 


di. 


Squares 4 

Columns within squares 5 

Rows within squares 5 

Treatments ' 1 

Error 4 

Total 19 


The 9 di. for squares and columns within squares are exactly the same 
as the 9 cl.f. for columns in the cross-over design. Owing to the more 
complete elimination of the row effects, there are only 4 di. for error 
instead of 8 with the cross-over plan. 

To summarize, the cross-over design is particularly appropriate when 
the difference between the rows is substantially the same in all replicates, 
for in this case the whole of the real difference between rows is con- 
centrated in the single degree of freedom and the error variance is no 
larger than that of the latin squares. Even if the difference between rows 
is known to be variable, the cross-over design may be preferable in small 
experiments where few degrees of freedom are available for error. 

In dairy husbandry the cross-over may be used to compare the effects 
of two feeding rations on the amount and quality of milk produced by 
the cow. Since cows vary greatly in their milk production, each ration 
is tested on every cow by feeding it during either the first or the second 
half of the period of lactation, so that each cow gives a separate replicate. 
The milk yield of a cow declines sharply from the first to the second half 
of its period, so that the first half is always “better,” in the sense above. 
Whether a cross-over is superior to a set of latin squares depends on 
circumstances. The rate of decline is not constant from cow to cow; it is 
greater in general for high-yielding than for low-yielding cows. Thus if 
previous production records are available, the cows may be divided into 
pairs on the basis of yielding ability, each pair being made a separate 
2X2 latin square. This plan is likely to give a smaller error than the 
cross-over, though sometimes not sufficiently smaller to counterbalance 
the loss of degrees of freedom. 

The design can be used with any number of treatments, subject to the 
restriction that the number of replicates must be a multiple of the num- 
ber of treatments. With three treatments, for example, a plan can be 
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drawn up from the 3 cycles ABC , BOA, CAB , where the order of the 
letters denotes the row to which each treatment is applied. Each cycle is 
allotted at random to one-third of the replicates. For higher numbers of 
treatments a design is constructed in the same way from the columns of 
any latin square. When the number of treatments exceeds four, how- 
ever, the degrees of freedom for error are sufficiently large so that a set 
of latin squares is usually preferable. 


4.42 Statistical Analysis. All sums of squares are calculated in the 
usual way. For the example (table 4.10), the computations are as fol- 
lows. 


Correction factor: 


(352) 3 

20 


Total: (30) 2 + (14) 2 + (21) 2 +• * * + (23) 2 


= 6195.2 
6195.2 - 910.8 


Columns: §[(44) 2 4* (42) 2 +• 

„ (194 - 158) 2 

Rows: r 


20 


rn t x (216 - 136)* 

Treatments: 


* 4- (47) 2 ] - 0195.2 


- 357.8 

- 64.8 

- 320.0 


TABLE 4.10 Cross-over experiment bur comparing the speeds op two 

COMPUTING MACHINES A AND B 

Time (seconds minus 2 minutes) taken to calculate a sum of squares 


Bows 

Columns (replications) 

i _ _ 

Totals 

1 

2 

3 

4 

5 

6 

i 

; 7 

8 

9 

10 

First (poorer) 

A 30 

£21 

A22 

BIZ 

£13 

A29 

£ 7 

A12 

123 

£24 

194 

Second (bet- 












ter) 

£14 

121 

B 5 

A 22 

A18 

£17 

116 

B 14 

£ 8 

A 23 

158 

Totals 

44 

42 

27 

35 

31 

46 

23 

26 

31 

47 

352 


Treatment totals: A 216; B 136 


Analysis of variance 


Source of variation 

di. 

s.s. 

m.s. 

Columns (replications) 

9 

357.8 

39.8 

Bows (first versus second) 

1 

64.8 

64.8 

Treatments 

1 

320.0 

320.0 

Error 

8 

168.2 

21.0 


Total 19 910.8 
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Tho error s.s. is found by subtraction. The F-ratio for treatments is 
significant at the 1% level. 

We will calculate the confidence limits for the true difference in speed 
which were quoted in section 1.2. The average observed difference is 
8.0 seconds in favor of machine B. The estimated standard error of this 
difference i.s a/2(2iTo)/To, or 2.049, based on 8 d.f. Consequently the 
confidence limits for the true difference are 

8.0 d= 2.04W' 

where i f is the value of Student’s i corresponding to (1 ~~ P) } where P is 
the confidence probability chosen. For P — .95, t! ~ 2.306, and the 
limits are (8.0 =b 4.7), or 3.3 and 12.7. For P = .80, f « 1.397, and, 
for P = .99, t r = 3.355, giving confidence limits of (5.1, 10.9) and (1.1, 
14.9), respectively, as quoted. 

Worked examples for the case where there are two treatments are also 
given by Brandt (4.14) and Fieller (4.15); the latter also illustrates the 
analysis of covariance for this design and the procedure when one column 
is missing. 

In general, if there are t treatments and r replicates, the degrees of 
freedom subdivide as follows. 



d.f. 

Columns 

(r~ 1) 

Rows 

(t- 1) 

Treatments 

(f-D 

Error 

(t " D(r - 2) 

Total 

(tr - 1) 


4.43 Standard Errors. The usual formula 


* holds for the standard 


error of the difference between two treatment means. 
‘The substitution formula for a missing value is 

^rC + r(R + T)~2G 
V (t ~ 1) (r - 2) 


where the capital letters refer respectively to the totals of the column, 
row, and treatment in which the missing value appears and to the grand 
total. The difference between the mean of the affected treatment and 
the mean of a treatment with no missing value has a standard error 



(4.8) 
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4.5 Triple Grouping : Graeco-latin Squares 

4.51 Description. In this arrangement the treatments are grouped 
into replicates in three different ways with the consequence that the ef- 
fects of three different sources of variation are equalized for all treat- 
ments. 

TABLE 4.11 A 5 X 5 graeco-latin square 


Litters 

a 

Pea (weight groups) 
h c d 

e 

I 

A i 


Ci 

I>2 

E i 

11 

1 52 

C 4 

Di 


■A 5 

111 

Cs 

D 5 


Aa 

Bi 

IV 

D* 

Ei 

As 

Bs 

C s 

V 1 

Ei 

A 2 

5.1 

Ci 

Dz 


The additional grouping (usually represented by greet letters) is de- 
noted here by subscripts. Each treatment (A, - * *, E) appears once in 
each row and column and once with each subscript (1 * • *5). 

One example is an arrangement proposed by Dunlop (4,16) for 
testing 5 feeding treatments on pigs. The arrangement requires 5 pigs 
from each of 5 litters (I, II, III, IV, and V), the effects of litter dif- 
ferences being equalized by making litters correspond to the rows of the 
square. The animals are housed in 5 pens (a, b } c, d, and e) which con- 
stitute the columns. The columns are used also to control differences in 
weight; i.e., the heaviest animal in each litter is assigned to the first pen, 
the next heaviest to the second pen, and so on; consequently the columns 
eliminate simultaneously the pen differences and the principal variations 
in weight within litters. Thus far the design is an ordinary latin 
square. 

The subscripts signify the positions of the five feeding crates within 
each pen. This extra control, though possibly unnecessary, was sug- 
gested for several reasons. The first and fifth crates in each pen were of 
different construction from the other crates; moreover the pigs occupying 
the end crates have less “company” when feeding. 

The graeco-latin square has not been used often, probably because the 
units can seldom be balanced conveniently in all three groupings. In the 
example discussed above, the practical difficulty was to secure five litters 
of which each contained five or more pigs. An interesting industrial ap- 
plication is described by Tippett (4.17). 
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The designs have been constructed for all numbers of treatments from 
3 to 12, except G and 10; examples are shown in plan 4.2 (p. 120). 

462 Randomization. Arrange the rows and columns independently 
at random. Assign the latin letters and the subscripts at random to 
their respective classifications. 

4.53 Statistical Analysis. The sums of squares for rows, columns, 
treatments, and subscripts are all obtained in the usual way. For an 
r X r square the error has (r - l)(r - 3) degrees of freedom. This 
number is rather inadequate when r is less than 6. 
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Plan 4.1 Selected latin squares 
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Flan 4.1 (Continual) Selected lathi squares 
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Flan 4.1S Graoco-latm squares 
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Plan 4*2 ( Continued ) Graeco-latin squares 
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FACTORIAL EXPERIMENTS 


5.1 Description 

5.11 A 2 2 Factorial Experiment. In a factorial experiment the effects 
of a number of different factors are investigated simultaneously. The 
treatments consist of all combinations that can be formed from the dif- 
ferent factors. To illustrate the simplest case, consider an experiment 
on sugar beet with 2 factors. These were nitrogen, none (n 0 ) versus 3 
ewt. sulphate of ammonia per acre (n i), and depth of winter ploughing 
(7 in. versus 11 in.). Ploughing took place in late January, the nitrogen 
was applied in late April, and the seed was sown early in May. Since 
both factors occur at 2 levels or variations, the experiment is described 
as a 2 X 2 factorial experiment. The 4 treatment combinations are 
shown below, with the mean yields of sugar per acre (cwt.) underneath. 

Treatment combinations and yields of sugar (cwt. per acre) 

12 3 4 

(«o» 7 in.) (n h 7 in.) (no, 11 in.) (n h 11 in.) 

40.0 47.8 42.4 50.2 

The yields may he placed in the following 2X2 table. 


Depth 

Nitrogen 
no mi 

Mean 

Response 
to m 

7 in. 

40.9 

47.8 

44.4 

4-6.9 

11 in. 

42.4 

50.2 

40.3 

4-7.8 

Mean 

41.0 

49.0 

45.3 


11 in. minus 7 in. 

+1 .5 

+2.4 




The results might be summarized as follows. Considering the effect 
of nitrogen, we might report that the application of nitrogen increased 
yields by 6.9 cwt. with shallow ploughing and by 7.8 cwt. with deep 
ploughing. These figures are called the simple effects of nitrogen. They 
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I 

i represent the type of information that would be wanted, for instance, 

in giving advice to a farmer who always used shallow ploughing but was 
doubtful whether to apply nitrogen. For the simple effects of depth of 
ploughing, we might report that 11 in. ploughing was superior to 7 in. 

‘ by 1.5 cwt. in the absence of nitrogen and by 2.4 cwt. when nitrogen was 

applied. 

j There is another way of looking at the results. It sometimes happens 

that the effects of the factors are independent. By this we mean that the 
response to nitrogen is the same whether ploughing is shallow or deep, 
and that the difference between the effects of deep and shallow ploughing 
is the same whether nitrogen is present or not. In this event- the two sim- 
ple effects of nitrogen, 6.9 cwt. and 7.8 cwt., are estimates of the same 
quantity and differ only by experimental errors. On this supposition we 
would naturally average the two figures in order to estimate the response 
i to nitrogen. The average, 7.4 cwt., is called the main effect of nitrogen. 

\ It can be derived alternatively as the difference between the two column 

means in the table, 41.6 and 49.0. Similarly the main effect of depth of 
ploughing (11 in. minus 7 in.) is the average of 1.5 cwt. and 2.4 cwt,, or 
; 1.9 cwt. Note that a main effect, being an average of the simple effects, 

is more precisely estimated than the latter. In this experiment the 
standard error of a main effect is l/\/2 times that of a simple effect. 

Consequently, if we are sure that the factors operate independently, 
s the summary that was given above in terms of simple effects may be 

replaced by another that is both more concise and more accurate. This 
might read as follows. “The application of nitrogen increased the yield 
of sugar by 7.4 cwt., while 11 in. ploughing increased the yield by 1.9 
cwt. as compared with 7 in. ploughing.” It is worth repeating that when 
the factors are independent the figure 7.4 cwt. is the best estimate not 
only of the average response to nitrogen, but also of the response on plots 
ploughed to 7 in. and of that on plots ploughed to 11 in. In other words, 
the whole of the information in the experiment is contained in the main 
effects. 

The question arises: How do we know whether the factors are inde- 
pendent? Frequently the answer is suggested by knowledge of the 
processes by which the factors produce their effects. In the present case 
an agronomist might reason that deep ploughing should enable the plant 
to develop a more vigorous root system. With this the plant should be 
able to utilize more effectively any added nutrient such as nitrogen. 
Thus he might predict that the response to nitrogen would be greater 
with deep than with shallow ploughing, though he probably would not 
expect it to be much greater. In short, he would predict that the two 
factors would not be quite independent in their effects. 
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In addition to the information that may be available from such 
reasoning, a factorial experiment itself provides a test of the assumption 
of independence. For, if the depth of ploughing does affect the response 
to nitrogen, the difference between 7.8 ewt. (the response to nitrogen 
with deep ploughing) and 6.9 cwt. (the response with shallow plough- 
ing) is an estimate of this effect. The difference, 0.9 cwt., can be tested 
in the usual way by a 6-test, If it proves significant, the assumption of 
independence is rejected by the data. The difference (sometimes divided 
by a numerical factor) is called the interaction between nitrogen and 
depth of ploughing. 

Interchanging (he roles of the two factors, we may also consider 
whether the superiority of deep over shallow ploughing is affected by the 
presence of nitrogen. To measure the interaction in this case, we sub- 
tract 1.5 ewt. (superiority of deep ploughing when no nitrogen is added) 
from 2.4 cwt. (superiority when nitrogen is added). The difference is 
again 0.9 cwt. It is easy to see that this equality always holds with a 
2X2 experiment. Each difference is equal to the sum of the observa- 
tions in one diagonal of the 2X2 table, minus the corresponding sum 
for the other diagonal. Such interactions are called two-factor , or first- 
order, interactions. 

5.12 Advantages of Factorial Experimentation When Factors Are 
Independent. The advantages of factorial experimentation naturally 
depend on the purpose of the experiment. We suppose for the present 
that the purpose is to investigate the effects of each factor over some pre- 
assigned range that is covered by the levels of that factor which are used 
in the experiment. In other words the object is to obtain a broad picture 
of the effects of the factors rather than to find, say, the combination of 
the levels of the factors that give a maximum response. One procedure 
is to conduct separate experiments each of which deals only with a 
single factor. Another is to include all factors simultaneously by means 
of a factorial experiment. 

If all factors are independent in their effects, the factorial approach 
will result in a considerable saving of the time and material devoted to 
the experiments. The saving follows from two facts. First, as we have 
seen, when factors are independent ail the simple effects of a factor are 
equal to its main effect, so that main effects are the only quantities needed 
to describe fully the consequences of variations in the factor. Secondly, 
in a factorial experiment, each main effect is estimated with the same 
precision as if the whole experiment had been devoted to that factor 
alone. Thus, in the preceding example, half the plots receive nitrogen 
and half do not. Consequently, the main effect of nitrogen is estimated 



5.13 


ADVANTAGES OF FACTORIAL EXPERIMENTATION 


125 


just as precisely as it would be in a simple experiment of the same size 
devoted to nitrogen alone. The same result holds for the effect of depth 
of ploughing. The two single-factor experiments would require twice the 
total number of plots in order to equal the precision obtained by the 
factorial experiment. If there were n factors, all at two levels and all 
independent, the single-factor approach would necessitate n times as 
much experimental material as a factorial arrangement of equal precision. 
The gain from factorial arrangements in this case is very substantial. 

Practical considerations may diminish this gain. The experimenter 
frequently lacks the resources to conduct a large experiment with many 
treatments, and must proceed with only one or two factors at a time. 
Further, it has been pointed out previously that, as the number of treat- 
ment combinations in an experiment is enlarged, the standard error per 
unit increases. This standard error is therefore likely to be higher for a 
large factorial experiment than for a comparable single-factor experi- 
ment. This increase in standard errors can usually be kept small by the 
device known as confounding, described in chapter 6. 

5.13 Factorial Experimentation When Factors Are Not Independent. 
We assume that the purpose is still to investigate each factor over the 
range represented by its levels. When factors are not independent, the 
simple effects of a factor vary according to the particular combination 
of the other factors with which these are produced. In this case the 
single-factor approach is likely to provide only a number of discon- 
nected pieces of information that cannot easily be put together. In 
order to conduct an experiment on a single factor A , some decision must 
be made about the levels of other factors B, C , D, say, that are to be 
used in the experiment (e.g., whether all plots should be ploughed 7 in., 
9 in., or 11 in. deep in an experiment on nitrogen). The experiment re- 
veals the effects of A for this particular combination of B , C, and Z), but 
no information is provided for predicting the effects of A with any other 
combination of B } (7, and 1). With a factorial approach, on the other 
hand, the effects of A are examined for every combination of B, C 3 and 
D that is included in the experiment. Thus a great deal of information is 
accumulated both about the effects of the factors and about their inter- 
relationships. 

In this connection, Fisher (5.1) has pointed out that it is sometimes 
advisable to introduce into an experiment an extra factor that is not it- 
self of interest, in order that the experiment may form the basis for 
sounder recommendations about the other factors. In agricultural ex- 
perimentation in Britain, farmyard manure has served as a subsidiary 
factor of this kind. Any recommendations made to farmers about other 
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factors will 1)6 put to the test in some fields wheie the fai mei lias ap- 
plied manure and in others where he has not, so that it is well to in- 
vestigate the other factors in both the presence and the absence of ma- 
nure. 

5.14 Summary Comments. To summarize, the following are some in- 
stances where factorial experimentation may be suitable : 

1, In exploratory work where the object is to determine quickly the 
effects of each of a number of factors over a specified range. 

2, In investigations of the interactions among the effects of several 
factors. From their nature, interactions cannot be studied without test- 
ing some of the combinations formed from the different factors. Fre- 
quently, information is most quickly obtained by testing all combina- 
tions. 

3, In experiments designed to lead to recommendations that must 
apply over a wide range of conditions. Subsidiary factors may be 
brought into an experiment so as to test the principal factors under a 
variety of conditions similar to those that will be encountered in the 
population to which recommendations are to apply. 

On the other hand, if considerable information has accumulated, or 
if the object of the investigation is specialized, it may be more profitable 
to conduct intensive work on a single factor or on a few combinations of 
factors. For instance, some investigations are directed towards finding 
the combination of the levels of the factors that will produce a maximum 
response. An interesting discussion of procedures for this purpose has 
been given by Friedman and Savage (5.2). They consider the ease, 
common in industrial experimentation, where the effect of any specified 
treatment combination can be determined quickly. Thus the treatment 
combinations to be tested can be decided as the experiment proceeds, in 
the light of knowledge gained from combinations that have already been 
tested. They propose first a single-factor experiment to determine the 
optimum level cq of A for fixed levels 6, c, and d, say, of the other factors. 
This is followed by a single-factor experiment to determine the optimum 
level bi of B for fixed values a 1} c, and d. Similarly, C is tested at a u b u 
d, and D at <q, &i, c%. After the completion of the first “round,” the 
whole process is repeated until the maximum appears to have been 
reached. The authors show that the maximum will usually be reached 
more quickly than with a complete factorial, since the plan is designed 
to concentrate on combinations that are near the maximum. The plan 
would not be feasible for agricultural field experiments, where in general 
an experiment can be changed only once a year. 

Experimenters sometimes find the results of factorial experiments 
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difficult to interpret, because they appear to present a bewildering va- 
riety of treatment comparisons. It is true that the competent summary 
of a large factorial experiment demands an orderly procedure and often 
takes considerable time. If the factors are for the most part independent, 
the method of analysis by means of main effects and interactions (to be 
illustrated later) will reduce the data to manageable proportions. If 
the numerous factors interact in a puzzling manner, prolonged study of 
the results and further experimentation may be needed before the facts 
are mastered. The trouble in this case is that the phenomena are com- 
plex, not that the experimentation is faulty. 


5.2 Calculation of Main Effects and Interactions 

5.21 Notation for the 2 n Series. The object of section 5.2 is to ex- 
plain how main effects and interactions are calculated, and how they are 
represented in the analysis of variance of the results. We begin with the 
2 n series, where each factor occurs at only two levels. 

For this system the notation used is similar to that of Yates (5.3). 
Letters A, B, C, — * denote the factors. The letters a, 5, c, * • • denote 
one of the two levels at which the corresponding factor occurs; for pur- 
poses of clarity this level will be called the second level. The first level 
is signified by absence of the corresponding letter. Thus the treatment 
combination bd , in a 2 4 factorial experiment, means the treatment com- 
bination which contains the first levels of factors A and C, and the second 
levels of factors B and D. The treatment combination which consists of 
the first level of all factors is denoted by the symbol (1). 

The symbol (ah) will denote the mean of all observations which receive 
the treatment combination ab. The letters A, B , and AB, when they 
refer to numbers, will represent, respectively, the main effects of A and 
B and the A by B interaction. 

5.22 The 2 2 Factorial Experiment. As already shown in section 5.11, 
we have 

A = 1 H(«0 “ (fr) + (a) - (1)3 

B = U(ab) + (5) - (a) - (1)] 

Yates (5.3) introduces the same multiplier for the interaction, which 
he defines as 

AB = i[(ab) - ( b ) - (a) + (1)] 

These quantities and the general mean M are shown in terms of the 
means for the treatment combinations in table 5.1. 
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TABLE 5.1 Main effects and interactions expressed in terms of 

INDIVIDUAL TREATMENT MEANS: 2 2 FACTORIAL 


Factorial 

effect. 

Treatment combination 

Divisor 

a) 

(a) 

(6) 

(ab) 

M 

4- 

4“ 

+ 

4- 

4 

A 

— 

+ 

— 

4- 

2 

B 

- 

— 

4* 

+ 

2 

AB 


— 

— 

+ 

2 


The rows of the table express the factorial effects in terms of the 
original means. If the equations represented by the table are solved for 
{ah), (b), etc., in terms of M, A , etc., it will be found that the columns 
of the table enable us to express the original means in terms of the 
factorial effects. For example, from the column for (ab), 

(ab) - M + i-[A +B + AB] 

The only point to remember is that the factor \ occurs with all terms 
except M. From these results, simple effects may be calculated from 
factorial effects. Thus 

(a) — (1) = simple effect of A when B is at the first level — A — AB 

(ab) — (5) simple effect of A when B is at the second level = A. + A B 

These relations are useful when an experiment has been summarized in 
terms of the factorial effects, and it is later desired to estimate some of 
the simple effects. From the above example it may be noted that the 
quantity AB measures the error that is committed in estimating the 
simple effects of A if the two factors are erroneously assumed to be in- 
dependent. 

The functions A, B, and AB satisfy the conditions for an orthogonal 
set of functions (section 3.42). Consequently the squares of the fac- 
torial effects, when suitably multiplied, divide the treatments s.s. into 
three single components, each with 1 di. In practice these components 
will usually be calculated from the treatment totals [ab], [5], etc., rather 
than from means. If we define factorial effect Mats as illustrated below, 

IA] - [ab] - [6] + [a] - [1] 
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then the sum of squares for .4 in the analysis of variance is [A] 2 /Ar, where 
r is the number of replicates. Corresponding formulae hold for the other 
factorial effects. 

5.23 The 2 s Factorial Experiment. In this case there are eight treat- 
ment combinations: (1), a, 6, c, ab } ae , he, and ahe . The simple effect of 
A is determined for each of four combinations of the other factors: (1), 
b, c , and be. As before, the main effect of A is defined to be the average 
of these four simple effects. 

A - ±[(a&c) - (be) + (ab) - (b) + (ac) - (c) + (a) - (1)] 

Similar definitions hold for B and C. 

The interaction of A with B is now measured separately at each of the 
two levels of C. 

AB ( C at second level) — \ [(«6c) — (be) — (ac) + (<?)] 

AB (C at first level) - U(ab) - (b) - (a) + (1)] 

As would be expected, the quantity AB is taken as the average of these 
two effects. Thus 

AB - U(abc) ~~ (be) - (ac) + (c) + (ab) - (b) - (a) + (1)] 

There are two other first-order interactions, AC and BC } which are de- 
fined similarly. 

In addition, we encounter a new interaction. Separate estimates were 
given above for AB at each of the two levels of C. The difference be- 
tween these two estimates measures the effect of C on the AB inter- 
action. This difference, with the conventional factor is 

iKabc) - (be) - (ac) + (c) - (ab) + (b) + (a) - (1)] 

and may be called the interaction of AB with C, If the algebra is carried 
out, it will be found that the same expression measures the interaction 
of A C with B, and that of BC with A. Hence the quantity is called the 
ABC interaction. It is a three-factor , or second-order , interaction. 

Three-factor interactions are more difficult to understand than two- 
factor interactions. Fortunately, in practice three-factor interactions 
are often small relative to main effects and two-factor interactions; 
and quite frequently they can be neglected for the purposes to which the 
results are to be put. Occasionally, cases arise where they are important. 
It might happen, for instance, that factor A does not exert any influence 
unless factors B and C are present in the combination (6c). In this event 
the interaction ABC is as large as the main effect of A or the interactions 
AB and AC, The same effect may occur in less extreme cases, where the 
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combination be is particularly favorable to the response to factor A . If 
three-factor interactions are found to be substantial, a careful scrutiny 
of the simple effects is usually helpful in interpretation. 

The expressions for the factorial effects in terms of the treatment 
means are summarized in table 5.2. The rows of the table give the 


TABLE 5.2 Main effects and interactions expressed in terms of 

INDIVIDUAL TREATMENT MEANS! 2'* FACTORIAL 


Factorial 

effect 

! 

Treatment combination 

Divisor 

a) 

(a) 

(&) 

(ab) 

(c) 

(ac) 

(be) 

{abc) 

M 

4- 

+ 

4- 

4- 

4- 

4- 

4* 

4- 

8 

A 

— 

4- 

- 

4- 

- 

4- 

— 

4- 

4 

B 


- 

4* 

4- 

— 

— 

4- 

4- 

4 

C 

— 


— 

- 

4- 

4- 

+ 

4* 

4 

AB 


- 

- 

+ 

4- 

— 

- 

+ 

4 

AC 

+ 

- 

4- 

- 

- 

4* 

— 

4- 

4 

BC 

4- 

4* 

— 

- 

— 

— 

4- 

4- 

4 

ABC 

u~ 

4- 

4- 

— 

+ 



4* 

4 

* 


factorial effects in terms of the treatment means, while the columns give 
the treatment means in terms of the factorial effects. For example, 

(a.) = M + i-[A -B-C-AB-AC + BC + ABC ] 

(a) - (1) = A-AB-AC + ABC 

(abe) - 0) = A + B + C + ABC 

As before, the factor i- appears with all terms except iff in the expression 
for a treatment mean. The difference between two treatment means, of 
which two examples are given above, is easily found by noting the signs 
in the two columns in question. 

As the reader may verify, the 7 factorial effects are mutually orthog- 
onal, and each is orthogonal to M. If r is the number of replicates, each 

8 ^2 ^2 

factorial effect has the same variance, , or . The contribution of 

4rr 2 r 

any effect to the sum of squares for treatments is [ ] 2 /8r, where [ ] 
denotes the factorial effect total ; e.g,, 

[A] = — [1] + [a] — [b] + [ab] — [c] + [ac] — [be] + [a6c] 
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5.24 The 2 n Factorial Experiment. A few formulae will be given 
which apply to any number of factors, all at two levels. With n fac- 
tors, there are n main effects, n(n — l)/'2 two-factor interactions, 
n(n — 1) (n — 2)/6 three-factor interactions, and so on. The successive 
numbers are the coefficients in the expansion of (1 + l) n , omitting the 
first coefficient, unity. 

Two equivalent methods for writing out any factorial effect in terms 
of the original treatment means are available. Each is simple to re- 
member, though the actual writing may take some time in a large ex- 
periment. They will be illustrated by finding the BCDE interaction in 
a 2° experiment, with factors A, B, C, D, E, and F. 

Rule 1 , Evens versus odds. In every factorial effect, half the treatment 
combinations receive a + sign and half a — sign. Those which receive 
a + sign are those which contain an even number of the letters that ap- 
pear in the factorial effect. In the BCDE interaction 4 letters, 6, e , d, 
and e, appear. There are 8 combinations that contain an even number of 
letters: (1), he, bd , be, cd , ce, de , and bdee. Each of these can lie combined 
with any one of 4 combinations of the remaining letters a and /, namely, 
(1) , a, f, and af. Hence, the 32 terms that appear with a + sign are as 
follows. 


( 1 ) 

a 

f 

af 

he 

abe 

bef 

abef 

bd 

ahd 

bdf 

abdf 

he 

abe 

bef 

abef 

cd 

acd 

cdf 

aedf 

ce 

ace 

erf 

accf 

de 

ade 

def 

adef 

bede 

abede 

bedef 

ahedtf 


The 32 combinations with a — sign are those which have an odd num- 
ber of the letters b , e f d, or c. In detail, these are 


b 

ah 

bf 

abf 

c 

ac 

rf 

acf 

d 

ad 

df 

adf 

e 

ae 

<f 

aef 

bed 

abed 

half 

abedf 

bee 

abce 

beef 

abcef 

bde 

abde 

brief 

abdef 

ede 

aede 

edef 

aedef 


When calculated from the treatment means , this difference between the 
sums for the two groups of 32 combinations is divided by 2 n ~~ l to give the 
BCDE effect. 
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Rule 2. Algebraic. In this rule the BCDE interaction is expressed 
formally as 

BCDE « — — ~r (a + 1)9 ~ J )(c - 1 )(d - 1 )(« - l)(f + 1 ) 

2 n ~~ l 

Note that — signs appear with the factors that enter into the interaction 
and + signs with those that do not. If this expression is expanded al- 
gebraically, it gives the interaction as a linear function of the treatment 
means. 

As in the 2 3 and 2 3 cases, all factorial effects are orthogonal to one an- 
other and to the mean. The contribution of any effect to the sum of 
•squares in the analysis of variance is [ f/2 n r, where [ ] denotes the ef- 
fect total. 

5.25 Factors at More than Two Levels: a 4 X 2 Factorial. Various 
notations are used. For example, the 3 levels of a factor A may be de- 
noted by ay, a h a 2 , or by a h a 2 , a 3 , or simply by the numbers 0, 1, 2. For 
illustration, we give below the treatment totals in a 4 X 2 experiment 
on sugar cane. The treatments were 4 levels of dressing of potash, k 0 , 
ki, k 2 , and k 3 , in arithmetic progression, and 2 levels of phosphate, p 0 
and pi. There were 5 replicates in randomized blocks. 


TABLE 5.3 Total yields or sugar (tons per acre) 



h 

h 

&2 

h 

Total 

PQ 

180 

248 

277 

' - 

285 

990 

Pi 

251 

307 

342 

346 

1246 

Totals 

431 

555 

619 

631 

2236 


The main effect of phosphate (P), which occurs at only 2 levels, is of 
course derived from the comparison of the 2 marginal totals, 990 and 
1246. For the main effect of potash ( K ) there are 4 marginal totals 
which may be compared. . It will be recalled (section 3.42) that 3 in- 
dependent comparisons may be made amongst 4 totals, and that an in- 
finite number of such sets of 3 may be chosen. Thus the main effect of 
K comprises 3 independent comparisons. 

With the 2” system, we were able to define every main effect as a spe- 
cific linear combination of the treatment means. In the present case we 
could select a particular set of 3 independent comparisons, each of which 
would be a specific linear combination of the treatment means. These 
could be defined as the “components” of the main effect of K. However, 
the particular set that is most useful for the interpretation of the results 



5.25 


FACTORS AT MORE THAN TWO LEVELS 


133 


will change from experiment to experiment, so that a formal definition 
of this type would be of limited utility. The experimenter should use 
whichever set appears most relevant. 

Now consider the interaction between P and K. From the differences 
between the two rows in table 5.3, the effect of P is estimated separately 
at each level of K } the estimates being 

ko hi &2 k% 

(Vi - po) 71 59 65 61 

Any comparison among these 4 figures is a measure of the effect of K on 
the response to P, and therefore is a component of the interaction be- 
tween j P and K. Consequently the interaction between P and K con- 
sists of 3 independent comparisons, and is said to have 3 d.f. As with 
the main effect of K, the particular set of 3 components that will be of 
interest varies with the type of experiment. 

We may also wish to consider the interaction as the effect of P on the 
response to K. As we have seen, the response to K has 3 components. 
Since there are 4 increasing levels of potash dressing, we might choose as 
components the linear, quadratic, and cubic components of the response 
curve. Apart from a divisor, the linear component is 

—3(180) - 1(248) + 1(277) + 3(285) - 344 
at the lower level of P and 

-3(251) - 1(307) + 1(342) + 3(346) - 320 

at the higher level of P. The difference between these 2 figures, —24, 
estimates the effect of P on the linear response to K , and is a part of the 
interaction between K and P, The other two components are the ef- 
fects of P on the quadratic and cubic responses to /v. 

In a 2 n system, the interaction between P and K is identical with the 
interaction between K and P. In the more general case the correspond- 
ing result is that any component of the interaction between K and P can 
be derived from the components of the interaction between P and K . 
For example, the effect of P on the linear response to K can be mitten 

-24 - -3(71) - 1(59) + 1(65) + 3(61) 

so that it is a linear function of the responses to P at the 4 levels of K . 
In this more general sense, the two interactions are still equivalent. 

We will consider the analysis of variance of the 8 treatment totals in 
some detail With 5 replicates, the total s.s., on a single-plot basis, is 

(180) 2 + (251) 2 + (248) 2 + • • • + (346) 2 (2236) 2 

= 4165.2 

40 


5 
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The 7 <11. subdivide into 1 for the main effect of P, 3 for that of K, and 
3 for the PK interaction. In practice the first two terms are calculated 
directly and the interaction obtained by subtraction. The computations 


are 


7 (1246 - 690) 2 
‘ 40 


= 1638.4 


„ ( 431) 2 + ( 555) 2 + « 519) 2 + ( G31) 2 ( 2236) 2 

R ; 2518.4 

10 40 


TABLE 5.4 Preliminary analysis of variance of rata in table 5.3 



d.f. 

S.S. 

m.s. 

p 

1 

1638.4 

1638.4 

K 

3 

2518.4 

839.5 

PK 

3 

8.4 

2.8 

Total 

7 

4165.2 



As is common in agricultural experiments, the interaction m.s. is small 
compared with those for the main effects (in fact it is below the mean 
square for error). 

As an exercise we will divide the K main effect and its interaction with 
P into linear, quadratic, and cubic components. The first step is to cal- 
culate the totals for these effects separately at each level of P . The re- 
sults are shown below, the multipliers for the 4 levels of K being shown in 
parentheses. These multipliers are obtained from a table of orthogonal 
polynomials (5.9). 



Kt 

pi 

K c 

(-8 

, — 1, +1, +3) 

(+b -i, -i, +D 

(-1, +3, —3, +1) 

Po 

+344 

- 60 

+18 

Pi 

+320 

- 52 

-10 

Sum fpi + p 0 ) 

+664 

— 112 

+ 8 

Dill, (p! - p 0 ) 

- 24 

+ 8 

-28 


From the sum line, we obtain the 3 components of the K main effect, 
Ki = jrk ( 6G4) 2 = 2204.5, K q = W(H2) 2 = 313.6 
K, = - 5 h>(% 2 ) = 0.3 

Similarly, from the difference line, the components of the PK interaction 
are 


PKi - Tiro(24) 2 - 2.9, PK q - ¥ V(8) 2 = 1.6 
PKc ~ ¥ ftr(28) 2 - 3.9 
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The divisors are found from the usual rule for linear functions (section 
3.42). The three K components add to 2518.4, the total s.s. for K, while 
the PK components add to 8.4, the sum of squares for PK. The detailed 
analysis of variance is shown in table 5.5, 

TABLE 5.5 More detailed analysis of variance of the data 
IN TABLE 5.3 



d.f. 

s.s. or m.s. 

p 

1 

163S.4 

Ki 

1 

2204.5 

K<i 

1 

313.6 

K c 

1 

0.3 

PKi 

1 

2,9 

PK q 

1 

1.6 

PK C 

1 

3.9 


Since every component has 1 d.f., the mean squares are the same as 
the sums of squares. The error m.s. is 10.35, with 28 di. The quadratic 
component of K is significant, indicating a falling off in the response at 
the higher levels of application. Neither the cubic component nor any 
of the interactions approaches significance. 

5.26 A 3 2 Factorial Experiment. As a second example, the treatment 
totals are shown in table 5.6 for an experiment with 3 levels of nitrogen 
fertilizer and 3 of phosphate fertilizer. The data are the numbers of 
lettuce plants that emerged from the ground and are totals over 12 plots 


TABLE 5.6 Numbers of lettuce plants emerging * 



*0 

fti 


Totals 

Po 

449 (Aa) 

413 (Cp) 

326 (By) 

1188 

Pi 

409 (Bp) 

358 (Ay) 

291 (Cot) 

1058 

P2 

341 (Cy) 

278 (Ba) 

312 (Aft) 

931 

Totals 

1199 

1049 

929 

8177 


* The use of the latin and greek letters will be explained later. 

each. Both nitrogen and phosphate appear to have had a deleterious 
effect on emergence (the subscript 2 denotes the largest application). 
The main effects of N and P both comprize two independent com- 
parisons, and thus have 2 d.f. each. Since the amounts of N and P were 
in arithmetic progression, it would probably again be appropriate, as in 
the previous example, to choose the linear and quadratic components of 
the regression on amount of dressing as the individual components of the 
main effects. 
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The Ni component can be estimated separately at each level of P. 
This me ans that there are 2 d.f. which measure the effect of P on the 
linear response to N. These are a part of the NP interaction. Another 
2 d.f. are supplied by the effect of P on the N q component, so that the 
NP interaction contains 4 d.f . 

From inspection of the margins of table 5.6, it is evident that the main 
effects of both N and P are approximately linear. Consequently, the 
most interesting single degree of freedom from the interactions is likely 
to be the interaction of Ni with Pi- In many agricultural experiments 
this type of interaction is the only one that approaches significance. It 
is worth while calculating this interaction separately in the analysis of 
variance. The other three components, NiP q , A r „P;, and N q P q> will also 
be obtained. The initial computations appear in table 5.7. 


TABLE 5.7 Calculation op linear and quadratic effects for the 

ANALYSIS OF VARIANCE 


Po 

Pi 

P2 

Ni 

(~1, 0, 1) 

— 123 
-118 

- 29 

(b ~2, 1) 

- 51 

- 16 
+ 97 

no 

ni 

Pi 

(-1, 0, 1) 
-108 
-135 
- 14 

(1, ~2 } 1) 

-28 

-25 

+56 

Sum 

-270CA+ 

+ 30(A7 a ) 

Sum 

—257(7+ 

+ 3(7+ 

Pi 

+ 94(A r ,P,) 

+148(A T q Pi) 

Ah 

+ 94(7+7,) 

+84(P a Afi) 

Pa 

+ 84(JV)P a ) 

+ 7$>{NqPq) 

N q 

+148(P,Ag 

+78(P a A+) 


The left side of the table shows the. Ni and N q effects for each level of 
P, while the right side shows the Pi and P q effects for each level of N. 
For instance, from table 5.6, 

Nipo - 326 - 449 = -123; P q n 0 = 449 - 2(409) + 341 = -28 

The column sums give the individual components of the N and P main 
effects. Now consider the difference between the third and the first row. 
For the Ni column, this difference (+94) gives the linear effect of P on 
N : i.e., the N iP i interaction. From the Pi column, the difference gives 
the linear effect of N on P h or the PNi interaction, which is exactly the 
same as the NjPi interaction. The other two columns provide the N q Pi 
and the NiP q effects. 

Finally, the sum of the first and third rows, minus twice the second 
row, leads to the components of interaction that contain a quadratic 
term. It will be seen that all four components can be obtained from 
either the left or the right half of the table, so that in practice only one 
half is required. By computing both halves we verify the symmetry of 
the components with respect to N and P. 
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The squares of these quantities, with appropriate divisors, will give an 
analysis of variance of the 8 d.f. amongst the treatment totals into 8 
single components. Since an individual entry in table 5.6 is the total 
of 12 plots, the divisors may be verified to be as shown below. 



Ni or Pi 

N\ or P Q 

NiPi 

NiP q 

N q P i 

Vt 

Divisor 

72 

216 

48 

144 

144 

432 


For instance, the divisor for N q P q may be worked out as follows. The 
three N q figures in table 5.7 (i.e., —51, —16, +97) each have divisor 
12 (l 2 + 2 2 + l 2 ), or 72. Since the N q P q total is a linear function of 
these three figures, with coefficients 1,-2, and 1, this total has divisor 
72(1 2 + 2 2 + l 2 ), or 432. The analysis of variance is shown in table 5.8, 

TABLE 5.8 Subdivision of the treatment s.s. 



d.f. 

s.s. or m.s. 

Ni 

1 

1012.50 


1 

4.17 

Pi 

1 

917.35 

Pa 

1 

0.04 

NiPi 

1 

184.08 

NlPq 

l 

49.00 

NqPl 

1 

152.11 


1 

14.08 


The error m.s. in this experiment is about 59. The linear effects of both 
fertilizers are significant, with no indication of curvature. The NiPi ef- 
fect is significant at the 10% level, but not at the 5% level. 

In table 5.6, p. 135, latin and greek letters were superimposed so as to 
form a 3 X 3 graeco-latin square. This square leads to another method 
of calculating the siun of squares for the interactions (4 d.f.). Although 
the method is not likely to be of use for purposes of interpretation, it has 
formed the basis of some ingenious devices in the construction of de- 
signs. In the square, the column totals represent the main effects of N 
and the row totals those of P. Since the latin letter totals are orthogonal 
to rows and columns, it seems reasonable to suppose that they must 
represent two of the 4 components of the NP interaction. Similarly, 
the greek letter totals represent the remaining 2 components. These 
totals are shovm below. 

ABC Total a ft y Total 

1119 1013 1045 3177 1018 1134 1025 3177 



138 FACTORIAL EXPERIMENTS 5.26 

Each figure is now a total of 36 plots. The sum of squares of deviations 
of the Iatin letter totals, divided by 36, is 164.22, and the corresponding 
figure for the greek letters is 235.06. These add to 399.28, which is the 
same as the total s.s. for the interactions in table 5.8, apart from round- 
ing differences. 

5.27 General Method of Analysis. Suppose that there are three fac- 
tors, A, B, C } which occur at a, 0 , and y levels respectively. The main 
effects have (a ~~ 1 ), ( 0 - 1 ), and (7 - 1 ) components or degrees of 
freedom respectively. Each component of the main effect of A can be esti- 
mated separately at each of the 0 levels of B. Thus each component of A 
contributes (0 - 1) degrees of freedom to the AB interaction. This means 
that the AB interaction contains a total of (a — 1)(0 — 1) components 
or degrees of freedom. Similarly the AC interaction has (a — 1 )(t — 1) 
degrees of freedom, and the BC interaction has (0 — 1)(t — 1) degrees 
of freedom. 

To compute the sums of squares for the main effects and first-order 
interactions, we form two-way tables for each pair of factors. Consider 
the A by B two-way table. The total s.s. among cells has (afi — 1 ) de- 
grees of freedom. From the marginal totals in the table we compute 
the sum of squares for the main effect of A, with (a — 1 ) degrees of free- 
dom, and that for the main effect of B, with (0 — 1 ) degrees of free- 
dom. By subtraction, the sum of squares for the AB interaction, with 
(a — 1 ) (0 — 1 ) degrees of freedom, is obtained. 

There remains the three-factor, or ABC, interaction. Now each of 
the (a — ,1)(0 — 1 ) components in the AB interaction is estimated sep- 
arately at each level of G Y . It will therefore contribute (7 — 1) degrees 
of freedom to the ABC interaction, so that the latter contains in all 
(a — 1)(0 — j )(7 — 1 ) degrees of freedom. The sum of squares for ABC 
is also obtained most easily by subtraction. From the total s.s. amongst 
treatments, with (a/Sy — 1) degrees of freedom, subtract the sums of 
squares for A, B, C, AB ; AC, and BC. The remainder will be the sum of 
squares for ABC . 

It is hoped that the reader will find no difficulty in extending these 
methods to the case where there are more than three factors. In gen- 
eral, the sums of squares for main effects are calculated directly, and those 
for interactions are calculated hy subtraction. To compute an ABCD 
interaction, for instance, we require a four-way table for the four factors 
represented. From the sum of squares for this table we subtract the sum 
of squares for all main effects and two- and three-factor interactions 
among the factors in question. Calculations "must be checked by re- 
computation. Of course, if the experimenter subdivides any interaction 
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into single components as in the previous section, a check is provided by 
this process* 

In the general case it remains true that any component of a factorial 
effect is orthogonal to any component of any other factorial effect. Thus 
any component of the BC interaction is orthogonal to any component of 
the A main effect, or of the BCD interaction, etc. Two components of 
the same factorial effect may or may not be independent. For instance, 
if A occurs at three levels, the comparison (a 2 — a 0 ) is independent of the 
comparison (a 2 — 2 + a 0 ), but is not independent of (a\ — a 0 ). 

The following selected references contain a discussion of factorial ex- 
periments with some worked examples. 

5.3 Yates, F. The design and analysis of factorial experiments. Imp. Bur. Soil 
Sci. Tech. Comm. 35, 1937. This gives the most comprehensive account that 
is available, with numerous worked examples. 

5.4 Yates, F. Complex experiments. Jour. Roy . Stat. Soc . Suppl. 2, 181-247, 
1935. An earlier reference, with examples of 2 2 , 2 3 , 4 X 3, and 3 3 factorials. 

5.5 Tippett, L. H. C. The methods of statistics. Williams and Norgate, London, 
2nd ed,, 1937. The use of factorial designs in industrial experimentation is in- 
dicated. A worked example is given of the breakdown of the treatment s.s. 
ina5X4X3X2 factorial in single replication. 

' 5.6 Lindquist, E. F. Statistical analysis in educational research. Houghton 
Mifflin, Boston, 1940. The application of factorial designs in experiments on 
methods of teaching is described, with a worked analysis of variance for a 
4X3 design completely randomized. 

5.7 Bliss, C. I. Factorial design and covariance in the biological assay of vitamin 
D. Jour . Amer. Stat . Assoc, 35, 498-506, 1940. Worked example of a 3 X 2 
design. 


5.28 Interpretation of the Analysis: First Example. The separation 
of the treatment comparisons into main effects and interactions is a con- 
venient and powerful method of analysis in cases where interactions are 
small relative to main effects. When interactions are large, this analysis 
must be supplemented by a detailed examination of the nature of the 
interactions. It may, in fact, be found that an analysis into main effects 
and interactions is not suited to the data at hand. There is sometimes a 
tendency to apply the factorial method of analysis mechanically without 
considering whether it is suitable or not, and also a tendency to rely too 
much on the initial analysis of variance alone when writing a summary of 
the results. Below two examples are presented where the initial analysis 
of variance is not very informative, and where the results can be sum- 
marized better in terms of simple rather than factorial effects. 

The first experiment was conducted by the Wailuku Sugar Company. 
Three varieties of sugar cane were compared, in combination with three 
levels of nitrogen (150, 210, and 270 lb. N per acre respectively). The 
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crop was the second harvesting, or the first ratoon crop. In table 5.9 
only the relevant part of the analysis of variance is shown: i.e., the sum 
of squares for the main effects of V and N and for the VN interaction, 
plus the error s.s. The data are in tons of cane per acre. The conclusions 

TABLE 5.9 Analysis or variance of a 3 X 3 sugar cane experiment 



d.f. 

s.s. 

m.s. 

V 

2 

319.38 

159.69 

N 

2 

56.54 

28.27 

VN 

4 

559.79 

139.95 

Error 

24 

1053.84 

43.91 


from the analysis of variance are that the main effects of V and the VN 
interaction are both significant, but there is no sign of a main effect due 
to N. Tins statement tells little about the results of the experiment. 

The treatment totals (over 4 replications) are shown b elow w ith their 
standard errors. Since the standard error per plot is "n/ 43.91 = 6.626, 
the standard error for a treatment total is 13.3 as shown. 

TABLE 5.10 Treatment totals (tons) in a 3 X 3 sugar cane experiment 

(±13.3) 



n 0 

»1 

nz 

Total 

s.e. 

n 

266.1 

275.9 

303.8 

845.8 


V2 

245.8 

250.2 

281.7 

777.7 

d=23.0 

n 

274.4 

258.1 

231.6 

764.1 


Total 

s.e. 

786.3 

784.2 

d=23.0 

817.1 

2387.6 



Instead of having no effect, nitrogen has apparently given a steady in- 
crease in yields with the first two varieties, but a steady decrease with the 
third variety. Further, the significant main effects of varieties apply 
only to the average varietal yields over all 3 dressings of N, and not to 
yields with a particular rate of dressing. On the average t>i gives a sub- 
stantially higher yield than v 3 , but at the lowest level of N, v 3 is slightly 
above v\. 

The subsequent analysis may be made either by means of Wests ap- 
plied to table 5.10 or by means of a further subdivision of the analysis of 
variance. It is of interest to examine the response to N for each variety 
separately. To test the linear responses by means of a West, we require 
the standard error for ( n 2 — n 0 ) as computed for each variety. This 
standard error is 13.3 X 1.414, or 18.8, and since the 5% f-value for 24 
d.f. is 2.064, the quantities (n 2 - %) must attain the value 18.8 X 2.064, 
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or 38.8, in order to be significant at the 5% level The actual values are 
(n 2 ~~ «o)»i « +37.7; (n 2 - n Q )v 2 = +35.9; 

(n 2 — n 0 )v 3 = —42.8 

Thus neither of the increases with v± and v 2 quite reaches the significance 
level, though both are close to it. The decrease with is significant. 

The test of linearity of the response curves is made by means of the 
quantity (n 2 + n 0 — 2n\), calculated for each variety. The standard 
error of this quantity is 13.3 X \/Q } or 32.6. The reader may verify 
from table 5.10 that none of the curvature terms even exceeds its stand- 
ard error, so that there is no occasion to reject the hypothesis that the 
responses are linear. 

It is not quite so clear what tests are appropriate for appraising the 
varietal differences. Since, however, the difference between Vi and v 2 is 
very consistent at all levels of AT, a 2~test of the total difference is sug- 
gested. This difference is 68.1, and since the value required for signifi- 
cance is 23.0 X 1.414 X 2.064, or 67.1, the superiority of V\ over v 2 is 
just significant. Further, it is evident on inspection that y 3 does not dif- 
fer significantly from the other varieties at either of the two lower levels 
of N. At the highest level, v% is significantly below both v\ and v%. 

When interactions are large, much care is required in the preparation 
of a statement that summarizes the results, and it is not easy to reach a 
form that is free from criticism. The following is a suggestion for this 
example. 

“The increase in yield of cane to the highest dressing of N (270 lb. per 
acre) over the lowest dressing (150 lb. per acre) was 9.4 tons per acre with 
Vi and 9.0 tons per acre with v 2 . Both increases just failed to be signifi- 
cant at the 5% level. With v%, on the other hand, the highest dressing 
of N decreased the yield significantly by 10.7 tons per acre as compared 
with the lowest dressing. For all three varieties the effects of N appeared 
to be proportional to the amount applied, within the range investigated 
in this experiment. 

“Variety 1 gave a higher yield than variety 2 for all levels of N } the 
average difference, 5.7 tons, being just significant at the 5% level The 
yields for variety 3 did not differ significantly from those of the other 
varieties at the two lower levels of N . At the highest level of A r , the yield 
for variety 3 was significantly lower than that for either of the other va- 
rieties.” 

5.29 Interpretation of the Analysis : Second Example. This example 
is more complex, mainly because the factorial (a 3 X 3) is of an unusual 
type. The data come from a long-term experiment on meadow hay, 
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conducted at Lady Maimer’s School, Bakewell, England, with the co- 
operation of Rothamsted Experimental Station. The yields are for the 
1937 season. The two factors are shown schematically below. 


First factor (3 levels) 


f No manure j 

• Mixed artificials \ 
[ 8 tons compost | 


Applied in 
1936 


Second factor (3 levels) 


f No manure 
• Mixed artificials [• 
[ 8 tons compost J 


Applied in 
1937 


The nature of the experiment may become clear from a discussion of some 
of the individual treatment combinations. In any replicate there are 
nine plots, of which three received mixed artificial fertilizers in 1937. 
In the previous year, 1936, one of these three plots received no manure, 
one received mixed artificials, and one received compost. Thus these 
three plots enable 11 s to compare the residual effects of the 1936 appli- 
cations of artificials and compost, on plots which received artificials in 
the current year, 1937. Similarly, from other treatment combinations, 
we can compare the residual effects of artificials and compost on plots 
which were unmanured in the current year and on plots which received 
compost in the current year. 

Further, in any replicate there are three plots which were unmanured 
in 1936. Of these, one had no manure, one had artificials, and one had 
compost in 1937. Consequently we may also assess the direct effects of 
artificials and compost applied in 1937 on plots which were unmanured 
in the previous year, and likewise on plots which received artificials or 
compost in the previous year. Both the artificials and compost contain 
the three common plant nutrients, nitrogen, phosphorus, and potash. 

The system of treatments is an ingenious one, designed to measure 
direct and residual effects at the same time. The experiment was started 
in L932, but for simplicity we will ignore any effects of treatments applied 
prior to 1936. 

The treatment means and the analysis of variance are shown in tables 
5.11 and 5.12. The analysis of variance was calculated from the treat- 
ment means, rather than from single plots, and the error m.s. has been 
adjusted so as to apply to a treatment mean. The direct effects are 
highly significant, the interactions are significant at the 5% level, but 
residual effects are not significant. In considering the nature of the ef- 
fects leading to these results, it is again convenient to think in terms of 
simple effects rather than of main effects and interactions. It is worth 
no ting th at the standard error of a single entry in the two-way table is 

v 6.656, or 2.58, while that for the difference between two entries is 
3.65. 
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With regard to residual effects, table 5.11 shows that, compost pro- 
duced a large and highly significant increase, 13.4 cwt. =b 3.65, on the 
plots that received no manure in the current year. On plots that re- 
ceived manure in 1937, either artificials or compost, there is no suggestion 
of any residual effect of compost. None of the residual effects of arti- 
ficials approaches the level required for significance. These results are 

TABLE 5.11 Treatment mean yields (hay, cwt. ter acre) (±2.58) 


Residual effects of 1930 treatments 
Direct effects of t ■ 1 


1937 treatments 

None 

Artificials 

Compost 

Mean 

None 

53.6 

56.8 

67.0 

59.1 

Artificials 

80.8 

82.3 

80.5 

81.2 

Compost 

74.3 

69.1 

70.0 

71.1 

Mean 

69.6 

69.4 

72.5 

70.5 


TABLE 5.12 Analysis of variance of treatment means 


d.f. 


Direct effects 2 

Residual effects 2 

Interaction 4 

Error 24 


s.s. 

m.s. 

732.28 

366.34 ** 

18.24 

9.12 

97.01 

24.25* 

159.74 

6.656 


in accord with general fertilizer experience, since a compost is more 
likely to give residual effects than an inorganic fertilizer, and since re- 
sidual effects would be expected to show up most clearly on plots which 
have no manures during the current year. 

So far as direct effects are concerned, artificials were superior to com- 
post whatever the residual treatment. 


1936 manuring 
No manure 
Artificials 
Compost 


1937 artificials — 1937 compost (cwt.) 
+■6.5 
+13.2 
+10.5 


The differences among these three figures are within the limits of ex- 
perimental error. To test this point, we find the sum of squares of 
deviations of these differences, which comes to 22.73. This is divided by 
2 to make it comparable with the analysis of variance, giving 11.36, 
which represents two of the 4 d.f. for interactions. The mean square, 
5.68, is slightly below the error m.s. Thus the superiority of artificials 
appears to be consistent. 
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The increases for artificials and compost over no manure are much re- 
duced on plots that received compost in the previous year. 


1935 manuring 

(1037 artificials - 1037 none) 

(1937 compost — 1937 none) 

Compost 

-1-13.5 

4-3.0 

Artificials and none 

+26.4 

4-16.5 

(averaged) 




The striking reduction on plots with 1936 compost is simply a reflection 
of the nature of the residual effect of compost as previously examined. 

The conclusions from the experiment might be summarized as follows. 

“Artificials applied in 1937 increased yields by 13.5 cwt. on plots that 
received compost in the previous year, and by 26.4 cwt. on plots that did 
not receive compost in the previous year. The corresponding increases 
due to 1937 compost were 3.0 cwt. and 16.5 cwt., respectively. The su- 
periority of 1937 artificials over 1937 compost, which averaged 10.1 
cwt., appeared to be independent of the type of manuring during the 
previous year. 

“As regards residual effects, compost applied in 1936 increased yields 
by 13.4 cwt, on plots that were unmanured in 1937, but gave no apparent 
increase on plots manured in 1937. There were no significant residual 
effects of 1936 artificials/ 7 

It seems evident that the significant interaction m.s. in the analysis 
of variance must arise mainly from the fact that compost had a residual 
effect only when no manure was applied currently. As an exercise it may 
be instructive to isolate the part, of the interaction s.s. that is due to this 
effect. Since artificials appeared to have no residual effect, we will com- 
bine the 1936 artificials with the 1936 unmanured plots. Consequently, 
the residual effects of compost are obtained from the comparison 

2(1936 compost) - (1936 artificials) — (1936 none) 

This quantity may be calculated separately for each of the 1937 treat- 
ments. Its values are 

. (1937 none) - +23.6; (1937 artificials) - -2.1; 

(1937 compost) = —3.4 

The contrast between the residual effect of compost on plots without 
1937 manures and that on plots with 1937 manures may be estimated 
from the comparison 

2(+23.6) - (-2.1) - ( —3.4) - +52.7 
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The square of this quantity, with a suitable divisor, is a single com- 
ponent of the interaction s.s. Since the divisor for the quantities 23.6, 
etc., is 6, the divisor needed is 6 X 6, or 36. Hence the sum of squares 
for this component is (52.7) 2 /36, or 77.15. The remaining 3 di, for in- 
teractions . have a sum of squares equal to 19.86. The mean square, 
6.62, is no larger than the error m.s. This verifies the suggestion that 
the significance of the interaction m.s. can be attributed to the type of 
residual effect of compost. 

Some readers may have difficulty in satisfying themselves that the 
component isolated above really is a part of the interaction s.s. It may 
be helpful to express the basic quantity, 52.7, as a linear function of the 
original mean yields in table 5.11. The multipliers of the means will be 
found to be as shown below. 


1937 treatments 

None 

1936 treatments 

Artificials 

Compost 

None 

-2 

-2 

4 

Artificials 

1 

1 

—2 

Compost 

1 

1 

-2 


The sums of the coefficients are zero over every row and column so that 
the expression is orthogonal to both sets of main effects. Hence it must 
be a part of the interaction. The coefficients also enable us to verify the 
divisor, 36, which is equal to the sum of the squares of the coefficients. 

Since the interactions can be attributed to the behavior of a single 
group of plots (those with compost in 1936 and no manuring in 1937), it 
might be suspected that this treatment had been allotted by chance a 
favorable set of plots. The residual increase, 13.4 ewt., does seem rather 
large in relation to the direct effect of compost. An examination of 
previous results does not lend much weight to this suspicion. Over the 
4 preceding years, the average residual response to compost, with no 
current manure, was 11.1 cwt. In 2 years, 1935 and 1933, the plots in- 
volved in this comparison -were the same as those in 1937. For these 
years the average residual effect was 6.2 cwt. In the other 2 years, when 
a different set of compost plots is involved, the average residual effect 
was 16.0 cwt. 

The two preceding examples are intended to illustrate the fact that 
the most informative subdivision of the treatment comparisons depends 
on the type of experiment. An analysis copied from a model that ap- 
pears similar in form may be inappropriate or even meaningless. The 
experimenter should first decide which comparisons are necessary for the 
interpretation of the results. The subsequent analysis, either by Wests 



146 


FACTORIAL EXPERIMENTS 


5.20 


or by subdivision of the analysis of variance, should be directed towards 
these comparisons. 

5.3 Designs for Factorial Experiments 

5.31 Factorials in Complete Block Designs. Most types of experi- 
mental plan are suitable for factorial experiments. ' In particular, if the 
total niunber of treatment combinations is not large, the designs de- 
scribed in chapter 4 are frequently used. The relative advantages of 
complete randomization, randomized blocks, and latin squares are the 
same with factorial as with non-factorial sets of treatments. For il- 
lustration, we show arrangements for a 4 X 2 factorial (i) in 8 random- 
ized blocks and (xi) in an 8 X 8 latin square. In the former case, only 
the first replicate is given. 

TABLE 5.13 Factorial experiment arranged in (i) randomized blocks 

AND (ii) A LATIN SQUARE 

8 replications 8 treatment, combinations 

A at 4 levels (1,2, 3, 4) 

B at 2 levels (1,2) 

Analysis of variance 
Replications 7 

Treatment combinations 7 

A 3 

B 1 

AB 3 

Error 40 

Total 03 


ii. In a latin square 
42 IT 22 12 31 41 32 21 

21 32 31 41 22 12 11 42 

12 41 42 11 32 21 22 31 

31 22 21 ~42 11 32 5T~T2 

32 12 11 31 ”21 22 42~41 

43™ 42 32 22 12 31 21 IT 

11 21 12 32 41 42 31 22 

22 31 41 21 42 11 12 32 

In the next section, a worked example of the analysis of a large fac- 
torial experiment in randomized blocks is presented. 


Analysis of variance 
Rows 7 

Columns 7 

Treatment combinations 7 

A 3 

B 1 

AB 3 

Error 42 

Total 03 


i. In randomized blocks 
Rep. I 
12 

31 
21 

32 

11 

22 ™~ 

41 
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TABLE 5.14 Weights of denervated (y) and corresponding normal (x) 
muscle (unit = 0.01 gram) 

Rep. I Number of treatment periods daily 


Length of 
treatment Type of 

One («i) 

Three (ug) 

Six (<r 6 ) 

1 

1 

{ 



j 

(minutes) 

current 

V 

;r 

y 

X 

V 

X 

1( h) 

Galvanic 

72 

152 

74 

131 

69 

131 


Faradie 

61 

130 

61 

129 

65 

126 


60 cycle 

62 

141 

65 

112 

70 

111 


25 cycle 

85 

147 

76 

125 

61 

130 

2 (bo) 

Galvanic 

67 

136 

52 

110 

62 

122 


Faradie 

60 

111 

55 

180 

59 

122 


60 cycle 

64 

126 

65 

190 

64 

98 


25 cycle 

67 

123 

72 

117 

60 

92 

3(h) 

Galvanic 

57 

120 

66 

132 

72 

129 


Faradie 

72 

165 

43 

95 

43 

97 


60 cycle 

63 

112 

66 

130 

72 

180 


25 cycle 

56 

125 

75 

130 

92 

162 

5(h) 

Galvanic 

57 

121 

56 

160 

78 

135 


Faradie 

60 

87 

63 

115 

58 

118 


60 cycle 

61 

93 

79 

126 

68 

160 


25 cycle 

73 

108 

86 

140 

71 

120 

Rep. II 

1 (b i) 

Galvanic 

46 

97 

74 

131 

58 

81 


Faradie 

60 

126 

64 

124 

52 

102 


60 cycle 

71 

129 

64 

117 

71 

108 


25 cycle 

53 

108 

65 

108 

66 

108 

2 (6 2 ) 

Galvanic 

44 

83 

58 

117 

54 

97 


Faradie 

57 

104 

55 

112 

51 

100 


60 cycle 

62 

114 

61 

too 

79 

115 


25 cycle 

60 

105 

78 

112 

82 

102 

3(h) 

Galvanic 

53 

101 

50 

103 

61 

115 


Faradie 

56 

120 

57 

110 

56 

105 


60 cycle 

56 

101 

56 

109 

71 

105 


25 cycle 

56 

97 

58 

87 

69 

107 

5 (h) 

Galvanic 

46 

107 

55 

108 

64 

115 


Faradie 

56 

109 

55 

104 

57 

103 


60 cycle 

64 

114 

66 

101 

62 

99 


25 cycle 

59 

102 

58 

98 

88 

135 
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5.32 Numerical Example: a 4X4X3 Factorial in Randomized 
Blocks. A number of experiments have indicated that electrical stimu- 
lation may be helpful in preventing the wasting away of muscles that are 
denorvatod. A factorial experiment on rats was conducted by Solandt, 
DcLurv, and Hunter (5.8) in order to learn something about the best 
type of current and the most effective method of treatment. The fac- 
tors and their levels are shown below. 


A 

Number of treatment 
periods daily 
l 

3 

t> 


B 

Length of treatment 
(minutes) 

1 

2 

3 

5 


C 

Type of current 

Galvanic 

Faradic 

60 cycle alternating 
25 cycle alternating 


Treatments wore started on the third day after denervation and con- 
tinued for 11 consecutive days. There are 48 different combinations of 
methods of treatment, each of which was applied to a different rat. Two 
replications were conducted, using 96 rats in all. 

The muscles denervated were the gastrocnemius-soleus group on. one 
side of the animal, denervation being accomplished by the removal of a 
small part of the sciatic nerve. The measure used for judging the ef- 
fects of the treatments was the weight of the denervated muscle at the 
end of the experiment. Since this depends on the size of the animal, the 
weight of the corresponding muscle on the other side of the body was in- 
cluded as a covariance variate. The data are shown in table 5.14. 

For a covariance analysis we require analyses of variance for both the 
denervated muscle (y) and the normal muscle (x) f and the analysis of the 
product (yx). From the discussion in section 5.27 and preceding sec- 
tions, die reader should have little difficulty in completing the analysis. 
The replicate totals, treatment-combination totals, and the grand total 
are first computed. From the treatment totals, the three two-way tables 
are constructed, as given in table 5.15. Since each set of main effect 
totals is obtained twice, this part is self-checking (though only one set of 
main effect totals need be recorded). To illustrate the computations 
made on the A X B table, we have for the analysis of y 

Total s.s. for 4 Xii table 

= |[(510) 2 + (543) 2 + • • * + (546) 2 ] - -gV[(6069) 2 ] = 1039 

A - -sV[(193G) 2 + (2028) 2 + (2105) 2 ] - ^[(6069) 2 ] = 447 

B - ^r[(1565) a + (1488) 2 + (1476) 2 + (1540) 2 ] - ^[(6069) 2 ] = 223 
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By subtraction, the AB sum of squares is found to be 369. For the sum 
of products, we replace each y 2 by the corresponding yx product. Thus 

Total s.p. = £[(510) (1030) + (543) (977) +• • •+ (546) (985)] 

- ^(6069) (11,307) = 1151 

Finally, the ABC sum of squares is found by calculating the total s.s. 
among all 48 treatment combinations, and subtracting the sum of squares 
for A, B, C, AB, AC, and BC. 

TABLE 5.15 Two-way tables in a 4 X 4 X 3 factorial 


B 

Length of 
treatment 

h 

h 

h 

h 

ai 

510 

4S1 

469 

476 

y 

543 

496 

471 

518 

ao 

512 

511 

536 

546 

Total 

1,565 

1,488 

1,476 

1,540 

ai 

1,030 

902 

941 

841 

X 

a 3 

977 

1,038 

896 

952 

897 

848 

1,000 

985 

Total 

2,904 

2,788 

2,837 

2,778 

a 

G j 

442 

485 

518 

1,445 

917 

992 

925 

2,834 

Type of 

F 1 

482 

453 

441 

1,376 

952 

969 

873 

2,794 

current 

60 

503 

522 

557 

1,582 

930 

985 

976 

2,891 


25 i 

509 

56S 

589 1 

1,666 

015 

917 

956 

2,788 

A totals 


1,936 

2,028 

2,105 

6,0G9 

3,714 

3,863 

3,730 | 

11,307 




h 

h 

V 

hi 

h 

h 

h 

X 

h 

5s 

C 

G 

393 

337 

359 

356 

723 

665 

700 

746 

Type of 

F 

303 

337 

327 

349 

737 

729 

692 

636 

current 

60 

403 

395 

384 

400 

718 

743 

737 

693 


25 

406 

419 

406 

435 

726 

651 

708 

703 


The analyses of y 2 , yx, and x 2 are given in table 5.16. The regres- 
sion coefficient of weight of denervated on weight of normal muscle is 
3977/16,013, or 0.243361. The next step is to form the analysis of 
(y — hx) 2 . This is done by adding to any y 2 value (—25) times the cor- 
responding yx value, plus (6 2 ) times the corresponding x 2 value. The 
mean squares for (y — hx) are the quantities used to test the significance 
of treatment effects on the weights of denervated muscles, adjusted for 
their regression on the weights of the normal muscles. As shown in 
section 3.86, the treatment m.s. is slightly inflated because of the sam- 
pling error of the regression coefficient. The inflation is sufficiently 
small, however, that only effects which appear to be on the borderline of 
significance need be tested by the exact method (section 3.86), which is 
more laborious to calculate. 
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In the analysis of variance (table 5.16) only two factorial effects are 
significant: those for the main effects of C (type of current used) and for 
the main effects of A (number of treatment periods daily). The AC in- 
teraction m.s. is somewhat/ higher than the error m.s. y but does not ap- 
proach the 5% significance level These results suggest that in preparing 


TABLE 5.16 Analyses of variance 


Replications 

A ( number of treatments) 
B (length of period) 

C (type of current) 

AB 

AC 

BC 

ABC 

Error 



r 

yx 

d.f. 

S.8. 

s.p. 

1 

605 

2,503 

2 

447 

64 

3 

223 

137 

3 

2,145 

104 

6 

369 

950 

6 

645 

476 

9 

299 

11 

18 

1,050 

666 

47 1 

3,199 

3,977 


X 2 

(v 

- bx)~ 

s.s. 

s.s. 

m.s. 

10,354 

0 


418 

441 

220.5 

415 

181 

60.4 

281 

2,111 

703.7 

5,202 

218 

36.3 

1,014 

471 

78.5 

2,015 

418 

46.4 

5,198 

1,040 

57.8 

16,013 

2,211 

48.1 


Total 95 2 8,982 8,888 40,910 7,091 

1 46 d.f. for (?/ — b;c)' 2 . 

2 94 d.f. for (y — bx) 2 . 

Note. In reference (5.8), the sums of squares for AC and ABC are in error. 


summary tables for further examination and for presentation of the 
results, we probably need only the AC two-way table. The averages 
for B (length of an individual treatment) should also be considered in 
case they indicate a trend effect that was not marked enough to attain 
significance. These data appear in table 5.17. 


TABLE 5.17 Weights of denervated muscle, adjusted for regression on 
weight of normal muscle (unit " 0.01 gram) 

AC two-way table (s.e. ±2.47) 


B means 

b x (>4 4 

Number of treatments 

ti\ as ao 

Means 

h 62.4 G 

!>a 81.4 Type of F 

K 64.7 current 00 

s.e. ±3.43 25 

56.0 59.1 65.2 

60.0 55.8 57.3 
63.2 63.9 68.6 
64.5 71.8 73.2 

60.1 

57.7 
65.2 

69.8 

Means 

60.9 62.6 66.1 

63.2 


s.e. ±1.24 
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The means have been adjusted in the usual way for the regression 
on the weight of the normal muscle. Because of this adjustment, every 
mean has a slightly different standard error. However, as pointed out 
in section 3.85, it is sufficiently accurate to use an average standard 
error. This is computed from the effective error m.s., which is 48.1 X 
1.019, or 49.0, as described in section 3.85. Thus the standard error for 
an entry in the two-way table is V49.0/8, or 2.47. 

The 25 cycle alternating current gave a significantly higher mean 
weight than any other type of current. The 60 cycle alternating current 
was superior to both galvanic and faradic currents, the last two not being 
significantly different. The weights increased as the number of treat- 
ment periods daily increased: On inspection the increase appears ap- 
proximately linearly related to the number of treatment periods. 

To test this supposition, we may fit a regression of the adjusted A 
totals on the number of treatment periods (the adjusted .4 means do not 
carry enough decimals to check with the analysis of variance). The 
relevant data are shown below. 


ai 

as 

Uq 


Adjusted z » number 
A totals of periods 

1950 1 

2005 3 £ (a - a){z - 2) « 421.067 

2115 ’ 6 ZO-2) 2 = 12.667 


By the usual formula, the contribution of the regression to the sum of 
squares for A is (421. 667) 3 / (12.607) (32), or 439. The last divisor, 32, is 
required because the A totals are totals over 32 rats. The sum of squares 
for regression accounts for practically all the sum of squares for A (441), 
so that there is no indication of deviation from linearity. The regression 
coefficient for a single rat is 421.667/(1 2.607) (32) or 1,0 unit. The con- 
clusion is that, within the range of periods tested, each additional treat- 
ment period per day increased the weight by 1 unit. 

The B means in table 5.17 do not indicate any consistent effect of 
length of an individual treatment period (1 to 5 minutes). The major 
part of the AC interaction seems to come from the anomalous behavior 
of the faradic current, which produced a drop in weight from <q to a 3 
and ae,. Since the interaction as a whole was not significant, it does not 
seem worth while to examine the statistical significance of this effect. 

5.33 Other Designs. The total number of treatment combinations in- 
creases rapidly as the number of factors or the number of levels of a fac- 
tor is increased. In this event the latin square necessitates an amount 
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of replication that is usually impracticable, while it becomes difficult to 
assemble homogeneous replications for a randomized blocks arrange- 
ment. Consequently, the experimental error per unit tends to increase. 
In order to avoid this increase in error, a number of designs have been 
produced. The basic principle, which is common to all these designs, is 
the use of a “block” that is smaller than a complete replication. That is, 
the arrangements are such that the differences among these “incom- 
plete” blocks are eliminated from error, just as the differences among 
replicates are eliminated from error in a randomized blocks design. Un- 
fortunately, as will be explained later, this reduction in block size can be 
accomplished only by the deliberate sacrifice of accuracy on certain 
treatment comparisons. The designs fall into three main groups ac- 
cording to the particular treatment comparisons that are sacrificed in 
this way. 

In the first group , the treatment comparisons that are sacrificed are 
the high-order interactions. These designs are appropriate in lines of re- 
search where experience has shown that high-order interactions are 
nearly always negligible. The designs available in randomized incom- 
plete blocks are described in chapter 6 and those which can be placed in 
latin squares in chapter 8. 

In the second group , known in agriculture as split-plot experiments, a 
factor, or a group of factors and their interactions, are sacrificed. These 
designs, which are very widely used, are discussed in chapter 7. Finally, 
in certain cases it is possible to construct designs so that the loss in ac- 
curacy is spread evenly over all factors and their interactions. Within 
this group there are several types: balanced lattices (chapter 10), bal- 
anced incomplete blocks (chapter 11) and balanced lattice squares 
(chapter 12). In cases where the effects of the factors and the sizes of the 
interactions are rather unpredictable, these designs may be suitable. 
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CHAPTER 6 


CONFOUNDING 


6,1 The Principle of Confounding 

6.11 The 2* Factorial Experiment with Complete Confounding. This 
chapter deals with designs where the size of block is reduced by the 
sacrifice of accuracy on certain high-order interactions. The method 
will be illustrated first for a 2 3 experiment with three factors, A, B, and 
C, each at two levels. Since there are only 8 treatment combinations, 
the replication is not particularly large and in practice this experiment 
would most frequently be arranged in ordinary randomized blocks or 
perhaps in an 8 X S lafcin square. The example is chosen because of its 
simplicity. 

The interaction of highest order is the ABC interaction. It will be 
recalled (section 5.23) that this interaction is estimated from the com- 
parison 

(abc) + (a) + (i b ) + (c) - (ab) - (ac) - (be) - (1) 

Suppose that each replicate in the experiment is divided into 2 blocks of 
4 units each, such that one block contains abc, a, b, and c, while the other 
contains ab, ac, be, and (1). With 3 replicates the plan (before ran- 
domization) would be as follows. 


TABLE 6.1 2 3 Experiment in blocks of 4 units, with ABC confounded 


Hep. I Rep. II Hep. Ill 


Block 


1 

2 

3 

4 

5 

6 

abc 


ah 


abc 


ah 


abc 


ab 

a 


ac 


a 


ac 


a 


ac 

b 


be 


b 


be 


h 


be 

c 


(1) 


c 


(1) 


c 


(i) 


There are two important properties of this plan. The total of blocks 
1, 3, and 5 ? minus the total of blocks 2, 4, and 6, is the ABC interaction 
total. Thus the ABC interaction is one of the components of the corn- 
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parisons amongst blocks. It is said to be completely confounded with 
blocks. Secondly, the other six factorial effects, A, B, C, AB, AC, and 
BC, will all be found to be orthogonal with the block totals. For in- 
stance, the AB interaction may be written (apart from the divisor) 

t (abc) + (c) - (a) - (5)] + [(ab) + (1) - (ae) - (6c)] 

Of the 4 units in any block, two carry a (+) sign in this expression and 
two carry a (— ) sign. Consequently, if we increase all the observations 
in a selected block by any amount, say 50, the estimate of AB remains 
unchanged, and similarly for the other 5 factorial effects. This means 
that these 6 factorial effects are not influenced by differences amongst 
blocks. Various phrases are used to describe this property: the effects 
may be said to be miconfounded with blocks, or free from block effects, 
or to be composed entirety of within-block comparisons. 

Thus differences amongst blocks of 4 units are eliminated from the ex- 
perimental errors of the main effects and two-factor interactions, whereas 
with randomized blocks only differences amongst blocks of 8 units are 
eliminated. The reduction in effective block size is attained by making 
A BC the same as one of the block comparisons. There is no within- 
block information available about ABC . 

The degrees of freedom in the analysis of variance separate as follows. 



d.f. 

Blocks 

5 

A, B, C , AB, AC, BC 

6 

Error 

12 

Total 

23 


All sums of squares are calculated in the usual way, so that there is no 
complexity in the computations. 

The composition of the error s.s. is perhaps worth noting. Consider 
the 3 blocks (1, 3, and 5 in table 6.1) that contain the treatments abc, a, 
b, and c. These may be regarded as a randomized blocks experiment 
with 4 treatments and 3 blocks. Consequently the interaction of treat- 
ments with blocks contains six components. Similarly in blocks 2, 4, 
and 6 the interaction of the other set of treatments with blocks provides 
six components. These two sets of six components constitute the error 
for the complete experiment. In other words, the error term in a con- 
founded factorial is made up of interactions between treatments and in- 
complete blocks. This remains true in the more complex designs that 
appear later in this chapter, even though in these cases the composition 
of the error is less easy to detect. 
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A BC does not appear explicitly in the analysis above. Actually, the 
experiment, if properly randomized, provides an estimate of error and a 
test of significance for ABC. In practice it is seldom worth while making 
this test, which is usually very insensitive. The test may, however, 
throw additional light on the nature of confounding. Let us ignore all 
other factorial effects and consider table 6.1 as the plan of an experiment 
to determine ABC, i.e., to determine the difference between the group 
of treatments abo, a, 6, c and the group ab, ac, be, (1). From this point of 
view the block becomes the “experimental unit” and the experiment is 
of the ordinary randomized blocks type, having 2 “treatments” and 3 
replicates. The total s.s. among the 6 “units” is, of course, the blocks s.s. 
in the previous analysis. Thus the blocks s.s. may be subdivided into: 

d.f. 


Replicates 2 

ABC 1 

Error for ABC 2 

Total 5 


If confounding has been effective, the error for ABC, being composed of 
comparisons among blocks, will be larger than the error (with 12 d.f.) 
which applies to the rest of the experiment. Moreover, the error for 
ABC is estimated from only 2 d.f., so that the test of ABC is a poor one. 
The test might be worth making with a considerable number of experi- 
ments of the same type, where it is desired to examine the avei'age ABC 
effect over the whole group. 

8.12 The 2 3 Factorial Experiment with Partial Confounding. Any of 
the seven factorial effects may be confounded with blocks in this way. 
The rest will then be free from block effects. These facts enable us to 
spread the confounding in an experiment among several factorial effects. 
A plan of this type is shown in table C.2. 

TABLE 8.2 2 s experiment in blocks of 4 units, with ABC, AC, BC 

PARTIALLY CONFOUNDED 

Bep. I Rep. II Rep. Ill 

Block 1 2 3 4 5 6 



ABC AC BC 
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The difference between the totals of blocks 1 and 2 represents the ABC 
interaction. In replication II, however, the block composition has been 
changed so that the difference between the block totals is the AC inter- 
action. In replication III the difference is the BC interaction. 

With this plan, A, B, C, and AB are entirely free from block effects, 
ABC is completely confounded with blocks in replicate I, but in the other 
two replicates the estimate of ABC is orthogonal with blocks. Thus a 
“within-block” estimate of ABC may be obtained from replicates II and 
III. Similarly, a ‘hvithin-bloek” estimate of AC is available from rep- 
licates I and III, and one of BC from I and TL These three effects are 
partially confounded with blocks, since an estimate that is free from 
block effects can still be made for each effect. Each “within-block” 
estimate is derived from 2 out of the 3 replicates in the experiment. The 
ratio % serves as a measure of the extent of the confounding. Yates 
(0.1) calls this ratio the relative information on the confounded inter- 
action. The ratio gives the amount of information available on the 
partially confounded effect, relative to that available on an uncon- 
founded effect. 

In the analysis of variance, all factorial effects appear. 


d.f. 


Blocks 5 

A 1 

B 1 

a i 

AB 1 

AC' 1' 

BC' 1' 

ABC' V 

Error 11 

Total 23 


The sums of squares for blocks and for the unconfounded effects A, 
B } C, and AB are found in the usual way* The sum of squares for AC is 
calculated only from replicates I and III, the divisor for [AC] 2 being 16 
instead of 24. Corresponding rules apply to BC and ABC . In the analy- 
sis of variance, the primes are inserted as a reminder that these effects 
are partially confounded and require special calculations. 

6,13 Confounding in the 2 n Series. In order to use confounded de- 
signs intelligently, it is not necessary to understand in detail the methods 
by which they are constructed. The rules for the construction of con- 
founded 2 n factorials will be briefly described, however, since they are 
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fairly simple and may help to explain the choice of plans presented at 
the end of this chapter. 

]f a replicate is to be divided into 2 blocks, any factorial effect can be 
confounded with blocks. In most cases we would confound the highest- 
order interaction in all replicates. Suppose that each incomplete block 
is to be further divided into 2, so that there will be 4 blocks (I * * * IV) 
in a replicate. Having chosen one factorial effect to represent the com- 
parison (I + [I — HI — IV), we may easily verify that any other fac- 
torial effect may be made to represent the comparison (I — II + III — 
IV). That is, to arrange a 2 n factorial in blocks of 2 n ~ 2 units, we may 
confound any two factorial effects that we choose. 

The remaining block comparison (I — II — III + IV) represents a 
third factorial effect, which is also confounded with blocks. Barnard 
(6.2) has shown that this effect is uniquely determined by the two ef- 
fect s that were chosen. It is always their “generalized interaction,” and 
is found by combining all the letters that appear in the two chosen fac- 
torial effects, and cancelling all letters that enter twice. Thus, if a 2 5 
factorial is to be confounded in blocks of 2 3 , or 8, units, we might be in- 
clined to choose ABODE and BODE as the first two factorial effects to 
be confounded. The generalized interaction of these two effects is 
ABBCCDDEE or, after cancellation, is A. Consequently, if these two 
effects are chosen for confounding, the main effect of- A is also automati- 
cally confounded. These results may be summarized as follows. “If a 
replicate of a 2 n factorial is arranged in blocks of 2 n "~ 2 units, three fac- 
torial effects are confounded with blocks. Of these, two may be chosen 
at will: the third is their generalized interaction.” 

To proceed, a plan with 8 blocks in the replicate can be obtained from 
one with 4 blocks in the replicate by dividing each block into two parts. 
In making this division we may confound any factorial effect, except one 
of the three that is already confounded. Thus, in the 2 5 example, where 
A BODE, BODE , and A were confounded in the division into 4 blocks, 
we might choose say ACE for the division into 8 blocks. But with 8 
blocks in the replicate, there are seven components of the comparisons 
among blacks, and each of these must represent one factorial effect. 
Hence, in addition to the four factorial effects that are known to be con- 
founded, three others are confounded. These three are the generalized 
interactions of ACE with ABODE , BCDE and A, or BD y ABD, and CE, 
respectively. 

The general rule should be clear from this case. Suppose that a rep- 
licate in a 2 n factorial is to be divided into blocks of 2 n ~~ h units, so that 
there will be 2 h blocks in the replicate. Then we may select any k fac- 
torial effects to be confounded, subject only to the restriction that none 
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of these must be a generalized interaction of any group of the others. A 
further (2 /c — 7c — 1) effects are automatically confounded. These are 
all the effects which can be expressed as generalized interactions of the 
group of effects selected for confounding. 

In the construction of designs for practical use, it follows that the ef- 
fects to be confounded must not be selected without examination of the 
other effects that are automatically confounded. If confounding is to be 
restricted to high-order interactions, we cannot choose ABODE and 
BCDE in a 2 5 factorial, since A would then be confounded. In order to 
avoid confounding any main effects or first-order interactions in this case, 
a little trial will show that the only possible choice is two second-order 
interactions and one third-order interaction: for instance ABC , ADE , 
and their generalized interaction BCDE , The plans at the end of this 
chapter contain what appear to be the best choices for general use. 

When the interactions to be confounded have been selected, there re- 
mains the problem of writing out the plan showing the treatment com- 
binations that appear in each block. This may be done by the procedure 
used in the present discussion. The replicate is first divided into two by 
confoimding one effect, then into four by confounding a second, and so 
on. The rules which express each factorial effect in terms of the original 
treatment combinations (section 5.24) decide the composition of the 
blocks at each stage. For experiments with a large number of factors, 
arranged in blocks of small size, Fisher (6.3) has given an alternative 
method that is more expeditious [see also Finney (6.4)]. In this method 
a group of 2 n ~ k letter combinations is constructed, such that the gen- 
eralized interaction of this group with any treatment combination gives 
the members of the block that contains this treatment combination. 

Fisher (6.3) has examined the general problem of confounding when 
the object is to keep all main effects and first-order interactions com- 
pletely clear of block effects, tie has shown that experiments up to the 
2 7 factorial (128 treatment combinations) may be placed in blocks of 8 
units, while blocks of 16 units can accommodate experiments up to the 
2 15 factorial (32,768 treatment combinations). 

The preceding discussion applied to a single replication. With more 
than one replicate we may repeat the first replicate, in which case the 
confounding is complete for the interactions involved. Alternatively, 
the effects that are confounded may be changed from one replicate to 
the next, so that some intra-block information is available on all effects. 

6.14 Example of a 2 4 Factorial Confounded in Blocks of 8 Units, 
Table 6.3 contains the plan and yields of a 2 4 field experiment on beans 
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TABLE 6.3 Plan and yields (beans in pounds) op a 2 4 factorial 



EXPERIMENT 

Rep. I 

Rep. II 


p k d nplc 


npk d p dnk 

Block a 

45 55 53 30 

Block a 

43 42 39 34 

375 

(Ink dnp dpk n 

351 

n dnp k dpk 


41 48 55 42 


47 52 50 44 






dp nk dk pk 


nk dp (1) np 

Block b 

50 44 43 51 

Block b 

43 52 57 39 

381 

dnpk (1) dn np 

395 

pk dk dnpk dn 


44 58 41 50 


56 52 54 42 


conducted by the Rothamsted Experimental Station in 1936. The fac- 
tors were 


Dung '(D): none, 10 tons per acre. 

Nitrochalk (N ) : none, 0.4 cwt. N per acre. 

Superphosphate (P) : none, 0.6 cwt. P2O5 per acre. 

Muriate of potash (K ) : none, 1.0 cwt. K2O per acre. 

With 16 treatment combinations in blocks of 8 plots, only one factorial 
effect is confounded in each replication. From the plan it will be seen 
that DNPK , the highest-order interaction, was confounded in each of 
the two replicates. The computations proceed as follows. 

Step 1, Calculate the totals for each treatment combination, the 
block and replicate totals, and the grand total The total s.s. and the 
sums of squares for replicates and for blocks within replicates should 
present no difficulty. 

Step 2. This deals with the computation of the factorial effect totals. 
Write down each factorial effect total as a linear function of the treat- 
ment combination totals, as shown in table 6.4. The treatment com- 
binations and their total yields are placed in a systematic order in the 
first two columns, and the remaining columns give the factorial effects. 
Although the writing of the table requires some time, the procedure is 
quite simple. When the columns for D and N have been written down, 
the column for DN is obtained as the product of corresponding signs in 
the D and N columns. Similarly, later in the table DNP may be ob- 
tained as the product of the signs of D and NP, or of N and DP, and so 
on. Thus it is necessary to know only the expressions for the main ef- 
fects. 

The factorial effect totals are shown at the bottom of each column. 





TABLE 6.4 Calculation of factorial effect totals in a 2 4 experiment 
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The total for DNPK is not required, since owing to the confounding 
DNPK will not appear explicitly in the analysis of variance. The DNPK 
column, however, allows a check to be made, as follows. If the 15 totals 
are added, it may be verified by inspection of table 6.4 that their sum, 
+06, should equal 16 times the (dnpk) treatment total, minus the grand 
total, i.e., 

16(98) - 1502 - +66 

Yates (6.1) gives a different method for obtaining the factorial effect 
totals. His method is completely automatic and does not require the 
expressions for the factorial effects to be written down. The present 
method, though probably not so speedy as Yates’s method, is more 
flexible in that only those factorial effects that are of interest need be 
obtained. Sometimes the experimenter will wish to isolate only the 
main effects and two-factor interactions, previous experience having 
shown that interactions of higher order are likely to be negligible. 

Step 3. The contribution of each factorial effect to the treatments 
s.s. in the analysis of variance is now obtained. Since there are 32 plots, 
the square of each effect is divided by 32. The analysis of variance ap- 
pears in table 6.5. Note that DNPK is omitted, since it already appears 


TABLE 6.5 Analysis of variance 



d.f. 

s.s. 

Replications 

l 

3.1 

mocks in reps. 

2 

123.2 

I) 

1 

2.0 

N 

1 

325,1*** 

P 

1 

6.1 

K 

1 

4.5 

DN 

1 

32.0 

DP 

1 

242.0 ** 

NP 

1 

78.1 

DK 

1 

6.1 

NK 

l 

32.0 

PIC 

1 

24.5 

DNP 

1 

2.0 

DNK 

1 

10.1 

DPK 

1 

15.1 

NPK 

1 

32.0 

Error 

14 

340.0 

Total 

31 

1277.9 

3 s.s. As a check, the 

total of the 15 


01.6 


24.29 


— — '"■*■** uu,j. J. V UJ. UWUUiCUU 0 . 0 . } OXJ.U, 

plus that tor DNPK, 78.1, should equal the treatments s.s. as found in 
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the ordinary way from the totals for the 1G treatment combinations. 
The error s.s. is found by subtraction. 

Only two effects are significant: the main effect of nitroehalk, which 
produced a depression of the yield, and the DP interaction. Since 
neither D nor P produced significant main effects, it is advisable not to 
lay much stress on the DP interaction in the absence of other confirm- 
atory evidence. 

Step 4- The best presentation of the results will depend on their 
nature and on the audience for whom they are intended. Table G.G gives 


TABLE 6.6 Deferential responses (pounds per plot) 


Factor 

Mean 

response 

Response with 

Dung (D) 

Nitroehalk (V) 

Super. (P) 

Mur. pot. (K) 

Abs. 

Pres. 

Abs. 

Pres. 

Abs. 

Pres. 

Abs. 

Pres. 

Dung (D) 
Nitroehalk (AT) 
Super. (P) 

Mur. pot. (X) 

— 0.5 
-0.4 
±0.9 
-0.8 1 

-8.4 

-4.6 

±0.1 

-4.4 : 
±0.4 
-1.6 

-2.5 

-2.2 

±1.2 

±1.5 

±4.0 

-2.8 

-0.0 

-9.5 

-2.5 

±5.0 

-3.2 

±1.0 

±0.4 

-4.4 

-0.9 

-3.4 

-8.4 

±2.6 


s.e.: ±2.46 for differential response; ±1.74 for mean responses. 


a concise presentation of the main effects and two-factor interactions in a 
form that has been used frequently at Rothamsted Experimental Sta- 
tion. The response to each factor is shown separately for each level of 
every other factor. Thus the row labelled “Dung” contains the mean 
response and the differential responses to dung. The figure —2.5 (re- 
sponse to dung with nitroehalk absent) is the average response to dung 
over all plots that did not receive nitroehalk. The table enables a quick 
appraisal to be made of the nature of the main effects and two-factor 
interactions. In the present case, of course, the table is scarcely needed 
because of the dearth of effects. 

The data in the table are easily calculated from the factorial effect 
totals in table G.4. For instance, 

Mean response to dung — ^[D] = = *—0.5 

Mean response to dung 

(nitroehalk absent) = tV { [Z)] — [DN]} = — = —2.5 
Mean response to dung 

(nitroehalk present) = -^{[D] + [DN]} « +y§ = +1.5 



164 


CONFOUNDING 


6.14 


These relations may be verified from the signs in the columns in table 


8.4. 

The standard error per plot is V24.29, or 4.93. Since each differential 
response in table 6.6 is the difference between the means of two groups 
of 8 plots, its standard error is (4.93) A or 2.46. The standard error for 


, 1 

a mean response is 


times this value, or 1.74, 


615 Confounding of a 8 2 Factorial. Let the symbol ij denote the 
treatment combination that has the ith level of A and the jth level of 
B (i } j =s 0, 1, 2). As before, we wish to keep the main effects clear of 
block effects, that is, to confound only the AB interaction. Since there 
are 9 treatment combinations in a replicate and since 3 is the only factor 
of 9, the size of the incomplete block must be 3 units. 

We have seen (section 5.26) that the main effects of A and B both con- 
tain two components, while the AB interaction has four components. 
Further, the main effects and the interaction can be divided into single 
components in various ways, and the division that is most appropriate 
for interpretation of the results will change from experiment to experi- 
ment. This suggests that the particular components of the interaction 
which we should desire to confound will vary from case to case. Actu- 
ally, the possibilities for confounding are much more restricted with the 
3 n than with the 2 n system: in fact, only one set of components of AB 
lends itself readily to confounding. 

The confounding is based on the properties of the 3X3 graeco-latin 
square. Suppose that the 9 treatment combinations are set out as in table 
6.7, on which a 3 X 3 square is superimposed. Following the notation 

TABLE 6.7 Use of a 3 X 3 graeco-latin square to obtain the AB 

INTERACTION 



60 

61 

h 

00 

(00) I iJ " 1 

( 0 W 2 

(02)73/3 

01 

(10 W, 

(ll)/W r 3 

( 12 ) 72/1 

02 

mitJi 

(2Wi 

( 22 ) 7 i/ 2 


used by Yates (6.1), we denote the latin letters in the square by h, h, 
and I3, respectively, and the greek letters by J\, J 2 , and J3, respectively. 

It is clear that comparisons among the row totals of the square give 
the two components of the main effect of A, while comparisons among 
the column totals give the main effect of B. Now consider the I totals: 

h - (00) + (11) + (22); I 2 = (01) + (12) + (20); 

Is = (02) + (10) + (21) 
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By the basic property of the latin square, comparisons among these 
totals are orthogonal to both rows and columns: that is, to the main ef- 
fects of A and B. Hence, the comparisons among the I totals must 
represent two of the four components of the AB interaction. The same 
argument shows that the two remaining components of AB are derived 
from the comparisons among the J totals, where 

Ji = (00) + (12) + (21); J 2 « (01) + (10) + (22); 

Jz - ( 02 ) + ( 11 ) + ( 20 ) 

A numerical verification of this fact was made at the end of section 5.26, 
where the sum of squares for /IB was calculated from the latin and 
greek letter totals which correspond to the / and J totals. 

The application of this result to confounding is shown in table 6.8. 
There are two possibilities. In plan (i) the I components of AB are com- 
pletely confounded with incomplete blocks, since the block totals have 
been made the same as the I totals. The main effects and the J compo- 
nents of AB are unconfounded. In plan (ii), the J components of A B are 


TABLE 6.8 3X3 Experiment with AB partialey confounded 


Plan (i) 

Incomplete blocks 


(i) 

(ii) 

(iii) 

(00) 

(01) 

(02) 

(11) 

(12) 

(10) 

(22) 

(20) 

(21) 

A 

h 

13 

1 components confounded 


Plan (ii) 

Incomplete blocks 


(i) 

(ii) 

(iii) 

(00) 

(01) 

(02) 

(12) 

(10) 

(11) 

(21) 

(22) 

(20) 

A 

A 

A 

J components confounded 


completely confounded. It should be noted that neither the I nor the 
J components are easy to interpret in terms of the results of an actual ex- 
periment. They are selected for confounding because they are con- 
venient for this purpose. 

If the number of replicates in an experiment is even, we may use plan 
(i) in half the replicates and plan (ii) in the other half. With this ar- 
rangement the I components may be estimated (clear of block effects) 
from those replicates in which plan (ii) is used, and vice versa for the J 
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components. The relative accuracy is for each set of components. 
It may be shown that the relative accuracy is also l / 2 for any component 
of AB in which we may be interested. The confounding is said to be 
balanced with respect to AB. Balanced confounding is preferable, for 
there is no reason to confound I more heavily than J . 

This design has limited practical utility. It may be used where the 
factors are known to operate independently, so that AB will be neg- 
ligible. If information about AB is wanted, a randomized blocks design 
is advisable unless there will be a substantial reduction in error variance 
from the use of incomplete blocks. The same principle of construction, 
however, leads to a very useful design for the 3 3 factorial 

6.18 Confounding of a 3 3 Factorial The same notation will be used. 
Thus, (021) denotes the treatment combination Since there are 

27 treatment combinations, the possible sizes of incomplete block are 
3 and 9 units. The plan for blocks of 3 units necessitates confounding of 
the two-factor interactions and is not given here. With 9 units per 
block, only ABC need be confounded. In the case of the 3 2 factorial we 
were able to divide the 9 treatment combinations into groups of three 
(the I and J groups), such that comparisons among the group totals 
gave the components of AB. We shall show that the 27 treatment com- 
binations in a 3 3 factorial can be divided into groups of nine such that 
comparisons among the group totals give the components of ABC. Each 
set of three groups will contribute two components of ABC. Since ABC 
has in all eight components or degrees of freedom, there will be four such 
sets. 

The AB interaction can be calculated separately for each level of C. 
As before, we will obtain the AB interaction from its I and J compo- 
nents, Table 6.9 shows the I components for each level of C. 


TABLE 6.9 The I components op AB shown for each levee op C 



CO 

Cl 

C2 

h 

(000) + (110) + (220) 

(001) + (111) + (221) 

(002) + (112) + (222) 

h 

(010) + (120) + (200) 

(011) + (121) + (201) 

(012) + (122) + (202) 

h 

(020) + (100) + (210) 

(021) + (101) + (211) 

(022) + (102) + (212) 


We may regard table 6.9 as a 3 X 3 table, in which each entry is the 
total of 3 treatment combinations. The row totals of the table give the I 
components of AB, while the column totals give the main effect of C. 
Further, we may take I and J totals from this table just as in table 6.7, 
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These totals will be called I — I u I — J u etc., since they come from the 
I components of AB. Thus 

I - h: (000) 4 (110) + (220) + (011) + (121) + (201) + (022) + (102) 4 (212) 

1 - hi (001) + (111) + (221) + (012) 4 (122) + (202) + (020) 4- (100) + (210) 

I - I 3 : (002) 4 (112) 4 (222) 4 (010) 4 (120) 4 (200) 4 (021) 4 (101) 4 (211) 

By the same argument as for the 3X3 design, the comparisons among 

these totals of 9 treatment combinations must represent two of the com- 
ponents of the interaction of AB with C ; that is, of the A JSC interaction. 

Consequently if we put all treatment combinations in I ~~ 1\ into the 
first block, those in I ~~~ I 2 into the second, and those in I — J 3 into the 
third, we have a plan which completely confounds two of the eight com- 
ponents of ABC j and leaves all other factorial effects unconfounded. 
This arrangement appears as replication III in plan 0.7 for the 3 3 fac- 
torial at the end of this chapter. 

The second set of three groups of 9 treatment combinations is obtained 
by taking the J totals from table 6.9. 

I - J x : (000) 4 (110) 4 (220) 4 (012) 4 (122) 4 (202) 4 (021) 4 (101) 4 (211) 

I - J 2 : (001) 4 (HI) 4 (221) 4 (010) 4 (120) 4 (200) 4 (022) 4 (102) 4 (212) 

I - J z : (002) 4 (112) 4 (222) 4 (Oil) 4 (121) 4 (201) 4 (020) 4 (100) 4 (210) 

This grouping constitutes replication IV in plan 6.7. The remaining 

sets are obtained by forming a 3 X 3 table similar to table 6.9 for the J 
components of AB at each level of C. The reader may verify that the 
J — 1 1, J — 12, and J — 1 3 groups are as shown in replicate I and the 
J — Ji, J — J 2 , and / ~~ Jz groups in replicate II of plan 6.7. 

6.17 Example of a 3 3 Factorial Confounded in Blocks of 9 Units, 
This experiment, conducted by the Seed Laboratory, Iowa State Col- 
lege, tests the effects of 3 levels of nitrogen, 3 of phosphorus, and 3 of 
potash on the germination of lettuce seedlings. The seed was thoroughly 
mixed, and divided into 108 samples of about 60 seeds each. Each 
sample was planted in a copper box, 6 in. square and 1)4 in. deep, in a 
mixture of soil and sand. The boxes were placed in a germinator, at a 
temperature of about 32°C. At the end of 5 to 7 days the seedlings 
were classified as normal, abnormal, hard, or dead. The data in table 
6.10 show the numbers of normal lettuce plants. 

There were 4 replications, each placed on a different shelf in the ger- 
minator. On a shelf the boxes were placed in 3 columns of 9 boxes each, 
each column being an incomplete block. It will be noted that all 3 fer- 
tilizers had a deleterious effect on emergence. 



TABLE 6.10 Number of lettuce plants emerging in a 3 3 experiment 
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TLe use of the 4* and — signs is discussed later in this section. Grand total 3177 



TABLE 6.11 Additional summary totals 
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From treat, totals 1104 991 1082 3177 

From rep. 4 302 290 311 903 
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The computations proceed as follows. 

Step L Form the block and replicate totals and the grand total 
(shown in table 6.10) and also the totals for each treatment combination 
and the three two-way tables (shown in table 6.11). 

Stc/p 2. These data enable us to calculate the total s.s. and the sum 
of squares for replications, blocks within replications, and for the N, 
P, K, NP , NK, and PK factorial effects. All are obtained in the usual 
way and are entered in the preliminary analysis of variance (table 6.12). 


TABLE 6J2 Analysis of variance for a 3 X 3 X 8 factorial 



d.f. 

s.s. 

m.s. 

Replications 

3 

2,041.88 


Blocks within replications 

8 

5,008.15 

626.02 

N 

2 

1,016.67 

508.34 ** 

P 

2 

917.39 

458.70 ** 

K 

2 

293.39 

146.70 

NP 

4 

399.27 

99.82 

NK 

4 

589.61 

147.40 

PK 

4 

212.89 

53.22 

NPK : confounded in replications 
1 

2' 

25.21 

12.60 

2 

2' 

64.22 

32.11 

3 

2' 

6.39 

3.20 

4 

2' 

198.30 

99.15 

Error 

70 

4,146.88 

59.24 

Total 

107 

14,920.25 


Subdivision of part, 

of the treatments s.s. 


N: L 

1 

1,012.50 

1,012.50 ** 

Q 

1 

4.17 

4.17 

Pi L 

1 

917.35 

917.35 ** 

Q 

1 

0.04 

0.04 

K: L 

1 

284.01 

284.01 * 

Q 

1 

9.37 

9.37 

NP: L XL 

1 

184.08 

184.08 

Rest 

3 

215.19 

71.73 

NK: L X L 

1 

256.69 

256.69* 

Rest 

3 

332,92 

110.97 

NPK: LXLXL 

V 

59.12 

59.12 


Notice that we do not calculate the total s.s. for treatments (with 26 
d.f.), since this contains part of the blocks s.s. 
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Step 3. There remains the calculation of the contribution from the 
NPK interactions. Before doing this, it may be remarked that some- 
times, from the nature of the factors or from previous experience, there 
is good reason to believe that the three-factor interactions will be neg- 
ligible. In this case the experimenter may decide to pool the sum of 
squares for NPK with the error s.s. without troubling to compute the 
sum of squares for NPK. The pooled error would have 78 di. and would 
be obtained by subtraction of the sum of squares calculated in step 2 
, from the total s.s. The danger from this procedure is that if three- 
factor interactions are present the experimenter may not detect them, 
and moreover his estimate of error will be inflated. For many types 
of field experimentation in agriculture the procedure seems reasonably 
safe. When there is doubt it is better to isolate NPK . 

Consider the two components of NPK that are confounded in rep- 
lication 1. The contribution of these components to the sum of squares 
for NPK must be calculated from the l'emaining 3 replicates, in which 
they are unconfounded with blocks. The totals needed are shown in 
table 6.11 under the heading 11 NPK components.” From the treat- 
ment totals, compute the total (944) of the 9 treatment combinations 
that appear in block 1 A. Thus 

Block 1A = 012 + 122 + 220 + 202 + 101 + 021 + 000 + 110 + 211 
944 = 119 + 81 + 92 + 96 + 82 + 99 + 171 + 118 + 86 

In the same way we obtain 1102 for the total over all treatments that 
appear in block IB and 1131 for block 1C. Underneath these figures 
we place the respective totals for blocks L4, IB, and 1C. By subtraction 
we obtain the totals 773, 748, and 737, These are the totals of the groups 
of treatment combinations taken over replications 2, 3, and 4. The sum 
of squares of deviations of these quantities from their mean is divided 
by 27, since each figure contains 27 observations. The result, 25.21, is 
the contribution of the two components of NPK to the sum of squares 
for NPK. The remaining six components are found similarly from rep- 
lications 2, 3, and 4. Of course, all eight components could be computed 
in one step as 

w[(?73) 2 + (748) 2 + • • • + (701) 2 + (771) 2 ] -^[(2258) 2 + ••• + (2274) 2 ] 

The error s.s., with 70 d.f., is calculated by subtraction. 

Step 4- In an experiment of this type, where each factor has equally 
spaced levels of an ingredient, it is usually advisable to examine the linear 
and quadratic components of the response curves. The contributions 
to the treatments s.s. are displayed in the lower section of table 6.12. 
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The method of calculation was described in section 5.26, where The data 
forTand P in this experiment were used as an example .411 hree fer- 
tilizers show significant linear responses, with no md.cat.on of any de- 

Pa wift regard todie^two-faetor interactions, the “linear by linear” com- 
ponents been isolated for NP and NK (see sectaon 5.26 for the 
method) For PK it was not thought worth while to do this, since the 
! ‘ tal g s for PK 212.89, is not large enough to allow any single compo- 
nent to be significant. The "linear by linear” component is significant 
for NK but not for NP. 

Where the “linear by linear” components of the two-factor interactions 
are large, it is desirable to isolate the “linear by linear by linear” compo- 
nent of the three-factor interaction. This will be done as an exercise. 
First we must define this component. With two factors, say N and P, 
the NlPl component has been defined as 

4_ roo) - (20) - (02) 


where the numbers refer to the levels of N and P. Ihe value of this 
quantity at the highest level of K, nrinus its value at the lowest level of 
K, measures the linear effect of K on N L P L , and is the N L P L K L eompo- 
nent in question. Algebraically, it is 

L = ( 222 ) + ( 200 ) + ( 020 ) + ( 002 ) - ( 022 ) - ( 202 ) - ( 220 ) - ( 000 ) 

The estimate from the treatment totals will be found to be —27. 

Since three-factor interactions are partially confounded with blocks, 
this component is also partially confounded and must be adj usted so as 
to remove block effects. The nature of the confounding is seen by writing 
down the expression above in the original table of plot yields (table 6.1.0). 
This has been done by means of the + and - signs that appear to the 
left of the seedling numbers in the table. It is evident that L is orthog- 
onal to the totals of blocks 1C, 2 B, 3 C, and 4C, since each of these 
blocks has one H- and one — . In blocks IP, 2C, 3B, and 4B, D has three 
-j- signs, while in the remaining blocks it has three signs. Hence, a 
quantity that is free from block effects is 
3 L - (IB) - (20 - (3B) - m + (1A) + (2 A) + (3 A) + (4A) 


or 

3 (_27) - 354 - 134 - 232 - 290 + 171 + 308 + 197 + 302 = -113 
the block symbols denoting block totals. This expression, apart from a 
divisor, may be shown to be the least squares estimate of NlPlKi, ad- 
justed for block differences. 

If this quantity is expressed algebraically as a linear function of the 
plot yields, the sum of squares of the coefficients is found to be 216. The 
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contribution of NlPlKl to the treatments s.s. is therefore (113) 2 /216, 
or 59.12, which is practically the same as the error m.s. If the number 
of replicates differs from four, the procedure for isolating the “linear by 
linear by linear” component remains the same, except that the divisor for 
the square becomes 54r instead of 216. 

Step 5. This concerns the presentation of the results. Usually it 
will be sufficient to show the three two-way tables of means, which are 
derivable from the two-way tables of totals (in table 6.11) on division 
by 12. Since all main effects and two-factor interactions are uncon- 
founded, standard errors and £-tests for the 3X3 tables are obtained 
Just as in a randomized blocks experiment. The principal results are 
that each fertilizer has produced a significant decrease in the numbers 
of seedlings that emerged, the decrease being substantially proportional 
to the amount of dressing. The significant NlKl interaction represents 
the fact that the decrease in emergence from n 0 to n 2 was smaller at the 
k 2 level than at the k 0 level. There is some indication of a similar effect 
with N and P, though this did not attain significance. 

The individual 27 treatment totals or means cannot be used as they 
stand for interpretative purposes, since they contain some block effects. 
A table of these totals or means will probably be unnecessary unless some 
aspect of the three-factor interaction requires study. To obtain such a 
table, we adjust each total so as to remove block effects. Each block 
effect is first estimated. For this purpose we do not use simply the ob- 
served block mean, since that in turn contains treatment effects. The 
least squares estimate of any block effect is 

^7 [4 (block total) — (total of treatments appearing in the block)] 

The basic data needed are available in the section headed u NPK com- 
ponents” in table 6.11. Thus, for block 1 A y the estimated effect is 

4(171) - (944)] - —9.6 

The block effects are given below. 


Block 


Replication 

A 

B 

C 

1 

—9.6 

+11.6 

+16.5 

2 

■f-4.2 

-4.0 

-15.1 

3 

- 8.8 

-5.4 

—5.4 

4 

4-3.9 

+6.3 

+6.0 


To adjust any treatment total, we note from the plan the 4 blocks in 
which it appears, and compute the sum of the effects for these 4 blocks. 
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This quantity is subtracted from the unadjusted total to give the ad- 
justed total. Thus for nipiko, which appears in blocks 1A, 2 A, 3 B, and 
4B , the adjusted total is 

US - [—9.6 + 4.2 - 5.4 + 6.3] - 122.5 
To obtain the adjusted mean we divide by 4 as usual. 

6.18 Mixed Series: the 3X3X2 Factorial. In the designs discussed 
in previous sections, all factors have the same number of levels. When 
the number of levels differs from one factor to another, the utility of 
confounding is limited. Usually some two-factor interactions must be 
confounded, and the computations become more laborious. Only five 
designs of this type are given at the end of this chapter, though a number 
of others are available in the literature. As an example, we will consider 
the confounding of a 3 X 3 X 2 factorial, which has 18 treatment com- 
binations. 

The main effects of a factor will be kept clear of block effects if every 
block contains each level of the factor the same nmnber of times. Thus 
the factor A } which occurs at 3 levels (0, 1, 2), is unconfounded if every 
block contains an equal number of Q’s, Us, and 2’s. Consequently, block 
size has to be a multiple of 3. Similarly we can keep the main effects of 
B (0, 1, 2) unconfounded if the block size is a multiple of 3. For the 
factor C at 2 levels (0, 1), the block size must be a multiple of 2. Hence, 
in order to keep all main effects clear of blocks, the only feasible block 
size is 6. 

Further, with 6 units in a block, every possible combination of the 
levels of A and those of C can appear once in every block, so that we may 
expect to be able to keep AC, and likewise BC, unconfounded. We can- 
not place all the 9 combinations for AB in a block, so that AB will be 
partially confounded. 

From this approach the plan can be constructed rather easily. Every 
block is to contain the 6 combinations (00), (10), (20), (01), (11), (21) 
of B and C. The only question is the manner in which the 3 levels of A 
are combined with the 6 pairs above. This allocation is to be such that 
all 6 combinations of AC appear in the block. That is, the 0, 1, and 2 
levels of A must each appear with the 0 level of C and each with the 1 
level of C. We may impose one additional rule, designed to confound 
A B as little as possible. Although we cannot represent all 9 combina- 
tions of AB in a block, it will be well to represent as many as possible. 
Thus we introduce the restriction that any AB combination (e.g., 12) 
must not appear more than once in a block. 

Under these rules only four types of replicate can be made up, as shown 
in table 6.13. 
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Consider, for instance, in how many ways the block containing (000) 
can be constructed. The first three A levels (reading from the top in 
table 6.13) must be either 0, 1, 2 or 0, 2, 1, so that each *4 level shall ap- 
pear with the 0 level of C. If 0, 1, 2 is chosen, the remaining three A 


TABLE 6.13 Possible blocks fob a 3 X 3 X 2 factorial 


B 

C 

Level of A 

la 

Ih 

Ic 

i 

iPe 

1I& 

lie 

i 

ilia 

mb 

IIIc 

I \a 

LV& 

IVc 

0 

0 

1 

2 

0 

2 

0 

1 

1 

2 

0 

2 

0 

1 

1 

0 i 

2 

0 

1 

0 

1 

2 

0 

I 

2 

1 

2 

0 

2 

0 

0 

1 

2 

1 

2 

0 

2 

0 

1 

0 

i 

2 

0 

1 

2 

0 

1 

1 

2 

0 

2 

0 

1 

1 

2 

0 

1 

1 

0 

1 

2 

2 

0 

1 

1 

2 

0 

0 

1 

2 

2 

1 ' 

1 

2 

0 

0 

1 

2 ! 

0 

1 

2 

2 

0 

1 


levels must be either 1, 2, 0 or 2, 0, 1, these being the only sequences that 
make no AB combination appear twice. These sequences give blocks 
la and 116 respectively. Similarly, the choice of 0, 2, 1 for the first three 
A’ s leads only to blocks IIIc and TVb. 

In the same way, we find that there are only four possible blocks con- 
taining the treatment combination (001) and four containing (002.) 
The three sets of 4 blocks can be grouped into 4 separate replications 
(I, II, III, and IV in table 6.13). 

As Yates (6.1) has shown, the plan can be written in a more con- 
densed form (table 6.14) which exhibits the nature of the confounding. 


TABLE 6.14 Condensed form of the plan for a 3 X 3 X 2 confounded 

FACTORIAL 


Level 

Blocks 

of C 




: 










la 

lb 

Ic, 

Ha 

ih 

He \ 

Ilia 

IHfc 

III, 

IV a 

IV* 

IV, 

0 

h 

h 

h 

h 

h 

h 

J 2 

Js 

1 

Jl 

Js 

Jl 

Jt 

1 

Is 

h 

h 

h 

h 

h 

Js 

Jl 

Js 

1 

,h 

Js 

Jl 


For example, in block I a the 3 combinations that appear with the 
zero level of C are (01), (12), and (20). This is the group that forms the 
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/ 3 component of AB (see section 6.15). It follows that in the first 2 
replicates the I component of AB is partially confounded; the J compo- 
nent can be verified to be unconfounded. In replicates III and IV the 
J component is partially confounded, while the I component is clear of 
blocks. ABC is partially confounded in all replicates. The relative in- 
formation on AB has been shown to be 14 and that on ABC, %. 

6.19 Numerical Example of the 3 X 3 X 2 Design. Table 6.15 shows 
the plan and analysis of variance for an experiment on the response of 
young tung trees to fertilizers. The results of this experiment have been 
reported by Merrill, Kilby, and Greer (G.17). The general object was to 


TABLE 6.15 Data cob 3X3X2 factorial experiment in blocks op 6 units 


la 


15 


Ic 

11a 

115 

lie 

100: 

80 

200 

00 

000 

88 

200: 

136 

000: 

43 

100: 

89 

210: 

86 

010 

: 55 

no 

73 

010: 

56 

130: 

81 

210: 

87 

020: 

70 

120 

: 82 

220 

75 

120: 

64 

220: 

90 

020: 

66 

201: 

74 

001 

: 67 

101 

78 

101: 

95 

201: 

81 

001: 

91 

Oil: 

82 

111 

: 67 

211 

51 

211: 

76 

Oil: 

61 

111: 

97 

121: 

86 

221 

: 57 

021 

66 

021: 

71 

121: 

65 

221: 

60 


478 


406 


381 1,265 


498 


421 


490 


lira 

III5 

111c 

IVa 

IV5 

IVc 

100: 

86 

200: 

73 

000: 66 

200: 88 

000: 

53 

100 

81 

010: 

79 

110: 

76 

210: 85 

110:107 

210; 

66 ' 

010 

58 

220: 

73 

020: 

97 

120:101 

020: 70 

120: 

92 

220 

56 

201: 

97 

001: 

116 

101:117 

101: 79 

201: 

88 

001 

90 

111: 

79 

211: 

86 

011:106 

Oil: 92 

111: 

109 

211 

68 

021: 

113 

121; 

81 

221:102 

221: 96 

021: 

95 

121 

67 


527 


529 

577 1,633 

532 


503 


420 


Analysis of variance 


Source 

d.f. 

s.s. 

m.s. 

Replications 

3 

3,837 

1,279 

Blocks within replications 

8 

2,836 

354 

Levels of 8-8-6 in row (A) 

2 

1,116 

558 

Meals (B) 

2 

254 

127 

Levels of 8-8-6 side-dressed (C) 

l 

868 

868 * 

8-8-6 in row X meals (AB) 

A 1 

1,129 

282 

8-8-6 in row X 8-8-6 side-dressed (AO) 

2 

2,995 

1,498 ** 

Meals by 8-8-6 side-dressed (BC) 

2 

m 

212 

8-8-6 in row X meals X 8-8-6 side-dressed (ABC) 

4' 

1,016 

254 

Error 

43 

8,909 

207 

Total 

71 

23,384 
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discover the type of fertilizer application that would stimulate the early 
growth so that the young trees would be ready for transplanting to com- 
mercial orchards by early autumn. The data shown are the mean heights 
in centimeters of the 12 tung trees on each plot. For convenience in 
following the computations, the plot yields follow the same order as in 
table 6.13. Note that factor A appears last in table 6.13 and first in 
table 6.15. 

The three factors were as follows, all dressings being per acre . 


A 

8-8-6 fertilizer applied 
in the row 

0 None 

1 200 lb. 

2 400 1b. 


0 

1 

2 


B 


None 

Tung meal (650 lb.) 
Cottonseed meal (650 lb.) 


C 

8-8-6 fertilizer applied 
as side-dressing 

0 None 

1 200 lb. 


The side-dressing was placed in a shallow furrow on one side of the 
row about 8 inches from the tung seedlings. Factors A and B were ap- 
plied at planting (March 13), factor C on June 3, and the height measure- 
ments were made on October 23. It should be noted that all three fac- 
tors supply each of the common plant nutrients (JV, P, and K)> 

In an experiment of this type, one might anticipate that two-factor 
interactions will be present. If, for example, 400 lb- of fertilizer applied 
in the row supply all the nutrients that the young tree can utilize, plots 
that receive this dressing will show no response to either factor B or C\ 
On the other hand, plots that receive the zero level of A may respond to 
the dressings supplied in factors B and C. It would not be wise to con- 
found any two-factor interactions heavily. Since, however, the soils 
available for experiments had been found to be very variable, it was de- 
cided to arrange the experiment in blocks of 6 plots. As we have seen, 
this will partially confound AB , though as the relative information is 
high (J4), not much information is lost on this comparison even if the 
incomplete blocks prove ineffective. 

Table 6.16 shows the computation sheet leading to the analysis of 
variance. The steps are as follows. 

i. Calculate the block totals (denoted by la, 15, etc,, and shown in 
table 6.15), and the treatment totals (table 6.16). Calculate the total 
s.s. and the blocks s.s. The latter has been divided into two com- 
ponents — replications (3 d.i) and blocks within replications (8 d.f.)- 
This separation enables us to estimate the decrease in error variance due 
to the reduction in block size from 18 to 6 plots. The analysis of variance 
appears at the foot of table 6.15. 

ii. Form the two three-way totals as shown. From the margins of 
the table for (ci + Co), calculate the sum of squares for the main effects 
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TABLE 6.10 Computation sheet for 3X3X2 experiment, blocks of 

6 units 


(i) Treatment totals 


' 

ro 

Cl 


bo 

h 

h 

h 

h 

h 

(tQ 

200 

248 

303 

364 

341 

345 

ax 

336 

337 

339 

369 

352 

299 

«2 

375 

324 

294 

340 

2S1 

315 


(ii) Two-way tables 


! 


Cl - 

b r 0 


ci - ro 


h 

b 1 

h 

Totals 

f>0 

bi 

h 

Totals 

<lQ 

564(1) 

5S0(4) 

648(7) 

1,801 

+164(1) 

493(4) 

+42(7) 

299 

«1 

705(2) 

080(5) 

63818) 

2,032 

+ 33(2) 

415(5) 

-40(8) 

8 


715(3) 

605(6) 

609(9) 

1,929 

- 35(3) 

-43(6) 

+21(0) 

-57 

Totals 

1,084 

1,883 

1,895 

5,762 

102 

65 

23 

250 


(iii) From (ci 4 ro) 

h - (U 4(5) 4(9) - 1,862 
h « (2) 4 (6) 4 (7) - 1,958 
h - (3) + (4) 4(S) - 1,942 


Ji « (1) 4(6) 4(8) - 1,807 
/ 2 « (2) + (4) + (9) « 1,903 
J* - (3) 4- (5) 4 (7) - 2,052 


5,762 

2// - 2J X 4 I a 4- H« « 4,700 
2/ a ' « 2/ a 4 1& 4 lib ~ 4,743 
2/3' - 2/3 4 lc 4- Hr - 4,755 


5,762 

2JV - 2Ji 4 Ilia 4 IV« - 4,673 
2 J 2 ' - 2/ 2 4 III* 4 IV& - 4,838 
2 J z ' - 2/ 3 4 HU 4 IV, - 5,101 


s.s. for AB 


14,198 

(4,700) 2 4 (4,743) 2 4 • ■ 


14,612 


• 4 (5,101) 2 (14, 198) 2 4 (1 4,612) 2 


84 


252 


1,129 


(iv) From (ci — Cq) 

Hi - 4200, R 2 - 432, ii! s - 418; ,81 - 481, - 4147, - 422 

where, e.g., R x - (1) 4 (5) 4 (9), * * •, & - (3) 4 (5) 4 (7) 

2 Ri' - 27?x - I 5 4 b 4 II 5 - II a - 4306 

2 /e 2 / - 27?2 4 In - b - 11a 4 He - 4153 

2 Rz f - 27?3 - l a 4- lb 4 II a - Il6 « 4 41 


4500 

28V - 2$i - IHb 4 HI, 4 IVt - I\% « 4293 

2 &' = 2& 4 IHa - III C - IV a 4 IV*, - 4132 

2^s' - 2S s - Ilia 4 nib 4 IV 0 - IVb - 4 75 


s.s, for ABO ** 


(4306)* 4 (41 53) 2 H b ( 475) 2 


4500 
(4 500) 2 


1,016 


60 


90 
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of A and B. From the total of the table for (ci — c 0 ), we calculate the 
sum of squares for the main effect of C: this is (250) 2 /72, since there are 
72 plots. Similarly, the margins of the table provide the sum of squares 
for the AC and BC interactions, which being unconfounded are obtained 
in the usual way. 

hi. These steps lead to the sum of squares for AB, First calculate the 
I and J components of AB , which are obtained by summation from the 
(ci + Co) table in step ii, as shown in table 0.16 (iii). Then apply the ad- 
justments for block effects. Since / lf for instance, does not appear in 
blocks l a or II,*, the adjustments involve the totals for these two blocks. 
The adjustments lead to the quantities 2/ff, etc., from which the sum of 
squares for AB follows as shown. 

iv. These steps lead to the sum of squares for ABC . From the table 
for (fix — c () ), step ii, form totals Ri and Si which are calculated by the 
same rules as for the F s and F s, respectively. These totals are then ad- 
justed for block effects, yielding quantities 2 R\, 2aSi', etc., from which 
the ABC s.s. is easily obtained. The error s.s. is then computed by sub- 
traction. 

v. Summary and presentation of results. Consider first the con- 
clusions indicated by the analysis of variance. The only significant, mean 
square for a main effect is that for C (side-dressing) . The AC interaction 
is significant at the 1% level, while the mean square for A, though not 
significant, is substantially above the error m.s. Neither B (meals) nor 
any of its interactions approaches significance. These results suggest 
that the AC two-way table should be examined. 

Since table 6.17 contains no confounded effects, the means per plot 
are found simply by addition and division from table 6.16, (i). It is 


TABLE 6.17 AC two-way table 


Row dressing 

Mean heights (centimeters) ± 4.15 

Side dressing 

co 

None 

Cl 

2001b. 

ao None 

62.6 

87.5 

ai 200 lb. 

84.3 

85.0 

aa 400 lb. 

82.8 

78.0 
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evident that there are large and significant responses to the 200 lb. dress- 
ings, both when applied in the row and as side-dressing. The means, 
84.3 for the row application and 87.5 for the side-dressing, do not differ 
significantly, so that the applications appear to have been about equally 
effective. There is no further increase to the 400 lb. row dressing, or to 
mixtures containing both types of dressing. Thus the large AC inter- 
action arises primarily because each dressing is effective only in the 
absence of the other. 

The analysis of variance would indicate that no definite effects of the 
meals (B) can be established. As an exercise, the AB two-way table, 
which involves partially confounded effects, will be examined. The 
means must be adjusted for block effects. This could be done by first 
calculating a mean for each incomplete block, adjusted for treatment 
effects, and then adjusting each AB mean for the 4 blocks in which it 
lies. It is slightly quicker to perform the adjustments by means of the 
1 and J effects, as follows. 

In table 6.16, (iii), divide the quantities 21*', etc., by 42, and calculate 
the deviations from their mean. The quantities 2 Jff, etc., are treated 
similarly. The same procedure is carried out for the Ps and J’s, except 
that the divisor is 24. This gives 

h' = -0.8, ty - 0.2, h f - 0.5 j t f « -4.7, fa* - -0.8, fa f = 5.4 

b - —2.4, ?2 - 1.0, iz - 0.9 ji — —4.7, is - —0.7, j s = 5.5 

- +1.6, di,2 — —1.4, Biz — —0,4 Bji — 0.0, 8j2 — —0.1, Bjz — —0.1 

The unadjusted mean of (a 0 6 0 ) is 70.5. From table 6,16, (ii) and (iii), 
we see that (ao&o) belongs to I\ and J\. Hence its adjusted value is 

(aoboY — («obo) +• +* Bji — 70.5 +• 1.6 ■+ 0.0 = 72.1 

By the same principle, the adjustment to (a 2 &i), for instance, is 
(Sis + $?i). The complete two-way table is shown below. 

TABLE 6.18 AB two-way table, adjusted fob block effects 


Row dressing 

Mean heights (centimeters) 

h 

None 

h 

Tung 

h 

Cottonseed 

no None 

72.1 

73.1 

79.5 

a\ 200 lb. 

86.6 

87.6 

79.4 

400 ib. 

88.9 

74.2 

77.6 
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The standard error of a single figure in the table is V 207/8, or 5.09, 
since each figure is the mean of 8 plots. This standard error applies to 
f-tests of unconfounded effects; i.e., of the main effects of A and B. For 
f-tests of components of the AB interaction, the figure must be multiplied 
by V 8/7, or 1.069, the value 8/7 being the inverse of the relative effi- 
ciency on AB. 

For f-tests of quantities that are a mixture of main effects and inter- 
actions, the multiplier lies between 1.069 and 1. It will usually be 
sufficiently accurate to use the multiplier 1.069. If desired, the correct 
standard error may be obtained by means of a rule which will be given 
without proof. The quantity whose standard error is wanted is expressed 
in terms of components of main effects and interactions. Suppose that 
we wish to test the response to cottonseed when no row fertilizer is 
applied. The response is (79.5 — 72.1), or 7.4 cm. The approximate 
standard error is 5.09 X 1.069 X \/2, or 7.69. To obtain the correct 
value, we write symbolically, 

., 7 , 0>2 — b 0 )(a 0 + + <h) , (& 2 - &o)(2a 0 — «i - a 2 ) 

(h - bo)a 0 = b 

O O 

If all means are unadjusted means, this is an algebraic identity. The 
first term on the right is a component of the main effect of J5, while the 
second is a component of the AB interaction. Further, by a property of 
least squares, the equation remains valid if means adjusted for block ef- 
fects are substituted for unadjusted means. By the rule for the variance 
of a linear function, the variance of the first term on the right is 6cr 2 /9, 
where <j 2 is the variance of a single figure. By the same rule, the variance 
of the second term is 



the factor 8/7 being introduced because the second term is a component 
of AB. The two terms are orthogonal, so that the estimated standard 
error of (b 2 ~~ bo) a o is 

5.09V'!- + (V-)(f) = 5 . 09 V 2.190 = 7:53 

Adjustment of the 18 individual treatment means is more tedious. In 
table 6.16, (iv), we divide the quantities 2 Ri, etc., by 30, and the quan- 
tities Ri, etc., by 24, thereafter taking deviations from the mean of each 
trio. This gives 

n r ~ +4.6, r 2 ' - - 0 . 5 , n' « -4.2 Si f « +4.2, s 2 ' - -1.2, - -3.1 

n ■ +4,8, r 2 « —2.2, n 38 —2.7 $1 - —0.1, $2 33 +2.0, S3 - —2.6 

5 = — (),2, <5r2 *= +1.7, $?"3 — —1.5 — +4.3, — 3.8, — * 6,5 
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111 this experiment there is little point in adjusting all 18 treatment 
means. However, the unadjusted treatment totals, table 6.16, (i), sug- 
gest that there might be an effect of the meals (factor B) when applied 
alone. Hence it may be of interest to test the difference between the 
adjusted means of aob 2 Co and ao&o^o- 

From table 6.10, (iii) and (iv), we see that a {} b Q Co belongs to i u j x , r h 
and 9 %. Since the unadjusted mean is 50.0, the adjusted mean is 

50.0 "f* ~f~ oji — — Bs x = 

50.0 + 1.6 + 0.0 + 0.2 - 4.3 = 47.5 cm. 

Note that the terms in dr and ds carry a minus sign whenever the treat- 
ment is at the c 0 level, and a plus sign when the treatment is at the c x 
level. For aob 2 c 0 , which belongs to ?‘ 2 , js, r 2 , and sq, the adjusted mean 
will be found to be 73.1 cm. 

In order to obtain the standard error of the difference (73.1 — 47.5) 
= 25.6, this must be expressed in terms of components of main effects 
and interactions. This requires some practice, but the method is not 
hard to master. Consider the following table of values of (h 2 — 5 0 ), 
averaged over the 4 replicates. 



th - 
«0 «1 

b o) 

&2 

Total 

Co 

d e 

f 

l 

Cl 

9 

h 

k 

m 

Total 

n 

0 

V 

Q 


The quantity whose standard error we wish is d. Now it is easy to 
verify that 

Gd = q+ (i l—m ) + (2n -~o—p) +[(2d— c— /) — (2 g—h—k)] 
components of B BC AB ABC 


As before, this is an identity for the unadjusted means and remains valid 
for adjusted means. Hence by the same rule as before 


36<r 2 4 » 


12 + 12 + 24 




22, 10cr 2 


where <r 2 is the estimated variance per plot. The extra factor 8/7 is used 
for the AB component and the factor S/5 for the ABC component, for 
which the relative information is 5/8. Consequently, the estimated 
standard error of d is 


22.48 X 207 


36 


= 1L4 cm. 



6.21 


GENERAL RECOMMENDATIONS 


183 


It follows that the lvalue for the response to cottonseed when applied 
alone is 25.6/11.4, or 2.25, with 43 d.f., which is significant at the 5% 
level. 

These results illustrate a feature that is common to the mixed series of 
confounded factorials. If the examination of the results is directed 
mostly at simple rather than factorial effects, the analysis is likely to be 
time-consuming because of the adjustments needed to remove block ef- 
fects and to construct standard errors by the method given above. It 
will also be apparent that considerable experience in the analysis of 
variance, or access to expert advice on this topic, is advisable. 

6.2 The Use of Confounded Designs 

6,21 General Recommendations. In order to make efficient use of 
confounding, it is necessary to consider whether the advantage is likely 
to outweigh the disadvantages. 

The advantage comes from the reduction in the experimental error 
by the use of a block which is more homogeneous, or which can be sub- 
jected to a more uniform technique, than the complete replicate. With- 
out some idea of the amount of this reduction, a realistic decision cannot 
be made on the question of confounding. If uniformity data have been, 
collected, the experimenter may compare the variability •within the in- 
complete blocks with the variability within replicates. In addition, 
from the results of an experiment in which confounding has been em- 
ployed, it is usually possible to estimate what the experimental error 
would have been if confounding had not been used. The method of cal- 
culation, which was first outlined by Yates (6.5, p. 215), is given in sec- 
tion 6.35. In many types of research the reduction in error from con- 
founding varies greatly from experiment to experiment. In such cases 
little reliance can be placed on the results of a single trial; a summary of 
a moderately large group of experiments is necessary. 

The disadvantages of confounding consist of (1) the reduction in 
replication on the confounded treatment comparisons, and (2) in some 
cases, a greater complexity in the calculations. With regard to the first, 
the experimenter should take into account the importance of the inter- 
actions that must be confounded and the extent to which they are con- 
founded. No interaction should be completely confounded unless there 
is good reason to believe, either from previous experience or from the 
nature of the factors to be tested, that the interaction will be negligible. 

As an illustration of the type of reasoning that might be employed in 
reaching a decision, consider the 3 X 3 X 2 design discussed in the pre- 
vious section. The incomplete blocks contain 6 units, with 7/8 relative 
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information on AB and 5/8 relative information on ABC. ABC is ex- 
pected to be negligible; AB, however, is of sufficient interest so that con- 
founding is undesirable if it involves any loss of precision for this com- 
parison. The issue resolves itself into the question: is the error variance 
per unit with 1 docks of 0 units likely to be less than 7/S ot the error vari- 
ance with blocks of 18 units? If so, the AB interaction is estimated more 
precisely from the confounded design than from randomized blocks, be- 
cause the reduction in variance per unit more than compensates for the 
loss of replied ion due to confounding*. Since the relative information is 
useful for this type of argument, it is given in table 0.19 for all the plans 
presented at the end of this chapter. 

The increase in the amount of computation varies both with the type 
of design and with the type of analysis needed to summarize the results. 
Where all the confounding is complete , the statistical analysis is prac- 
tically the same as with randomized blocks. With all factors at two or 
three levels (i.e., in the 2 n and 3" series), the extra computations are not 
formidable even when partial confounding is used; they are much more 
troublesome when the number of levels is not the same for all factors, 
as in the 3 X and 3 X 2 3 designs. The extra work consists in the cal- 
culation of adjustments to the partially confounded comparisons so that 
their sums of squares in the analysis of variance arc freed from block 
effects. If it is desired to present tables of mean yields which involve 
confounded comparisons, the entries in the tables must, bo adjusted so 
as to remove block effects. This point should be remembered, especially 
with designs where two-factor interactions are confounded. If /-tests of 
simple effects are needed, special computations must be ma.de to obtain 
the standard errors for these tests. Moreover, if part of the data, is miss- 
ing, the analysis tends to become much more complicated than with 
randomized blocks. 

The experimenter who has not previously used confounding and who 
docs not have access to expert statistical advice is advised to confine him- 
self at first to the simplest types of confounding. He should make sure 
that he understands the method of analysis before the experiment is con- 
ducted. After the experiment is completed, ho should investigate the in- 
crease in precision that has been achieved by confounding, as a guide to 
the future use of the technique. 

Yates (0.5) has pointed out that in certain circumstances a confounded 
design might give rise to a misleading interpretation of the results. This 
can be illustrated by means of the 2 3 factorial in table 6.1 (p. 154). This 
has three replicates, with ABC completely confounded in each replicate. 
Suppose that the response to A is much larger than any other effects, 
and that this response varies greatly from block to block. That is, the 
interaction of A with blocks is much larger than the interactions of the 
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other factors with blocks. Now the factorial effect total for BC may be 
written 

BC = (abc) + (6c) + (a) + (1) - (ab) ~ (ac) - (6) ~~ (c) 

= [(abc) + (a) — (6) — (c)] - [(ab) + (ac) - (be) ~~ (1)] 

= response to A in blocks 1, 3, 5 — response to A in blocks 2, 4, 6 

The point is that the factorial effect of BC is a component of the inter- 
action of A with blocks. When tested against the error m.s, (12 di.) 
this response might be significant, not because there is an effect of BC, 
but because A has large interactions with blocks. 

Another way of describing the source of the trouble is to say that the 
error m.s. is not homogeneous, in that the interaction of A with blocks 
is larger than the interactions of other treatments. If this were sus- 
pected, the danger of a misleading interpretation could be avoided by 
subdividing the error m.s. We can isolate 4 di. (two from blocks 1, 3, 
and 5 and two from blocks 2, 4, and 6) that represent the interaction of 
A with blocks. The mean square for these four could be used to test A 
and BC, while the remaining factorial effects would be tested by the other 
8 di. for error. This device can usually be applied in the 2 n series, though 
generally not in the mixed series. Further, the heterogeneity of the 
error variance may not be suspected. 

How frequently this danger arises will depend on the nature of the 
experimentation. Investigations by Yates (6.5) and Kempthorne (0.9) 
suggest that there is little danger with fertilizer experiments on English 
farm crops. In lines of research where errors have often been found to be 
heterogeneous, one should be on the alert for this effect. 

In some experiments the use of confounding is practically unavoidable. 
For instance, an experiment on methods of manufacturing ice cream, 
planned at North Carolina State College, contained 3 factors each at 3 
levels. It was desired to discover whether ordinary consumers could 
detect differences in the palatability of the 27 different types of ice cream. 
From previous experience it was believed that a taster (who was not an 
expert judge) could compare at most 3 ice creams at any single test. 
Moreover, it did not seem feasible to ask the tasters to grade the pal- 
atabilities on any kind of standard scale: they could merely be asked to 
rank the 3 ice creams in order of preference, if they had any preferences. 
Consequently, a design for a 3 3 factorial in blocks of 3 units was re- 
quired, even though this involved confounding of two-factor interactions. 
The design adopted also employed partial confounding of the main ef- 
fects, so as to obtain approximately equal accuracy on main effects and 
two-factor interactions. 
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TABLE 6.19 Index to plans of factorial experiments confounded in 

RANDOMIZED INCOMPLETE BLOCKS 

a. Designs with which any number of replicates may be used 
Num- Num- 


ber of ber of 
treat- units per 


Type. 

merits 

block 

Interactions confounded in a single replicate * 

Plan 

2 3 

8 

4 

ABC 

6.1 

2 4 

16 

8 

A BCD 

6.2 

2 4 

16 

4 

AB , ACD, BCD 

6.4 

2 5 

32 

8 

ABC , ADS, BCDE 

6.5 

2° 

64 

16 

ABCD , ABEF, CDEF 

6.3 

2 6 

64 

8 

ABC, CDE , ADF , BEF, ABDE , BCDF, ACEF 

6.6 

3 s 

27 

9 

ABC m 

6.7 

3 4 

81 

9 

ABC , ABD, ACD, BCD , all (?.f) 

6.8 

4 2 

16 

4 

AB (JO 

6.12 

4 X 2 2 

16 

8 

ABC (*0 

6.13 


* The fractions in parentheses give the relative information on the comparisons 
which are confounded. Where no fraction is given, the comparison is completely 
confounded. 

b. Balanced designs 

Number of 
replicates 

Number Units for a Interactions confounded and 



of treat- 

per 

balanced 

relative information (in paren- 


Type 

ments 

block 

design 

theses) * 

Plan 

2 4 

16 

4 

6n f 

All two-factor (JO; all three-fac- 
tor (J4) 

6.4 

2 6 

32 

8 


All three-factor (%); all four-fac- 
tor (H) 

6.5 

2 6 

64 

8 

10 n 

All three-factor (fi); all four-fac- 
tor (M) 

6.6 

3 s 

27 

9 

in 

All three-factor (%) 

6.7 

3 4 

81 

9 

4n 

All three-factor (%) 

6.8 

3X2* 

12 

6 

3 nt 

BC (%), ABC (JO 

6.9 

3 X 2 3 

24 

6 

Bn t 

BC, BD , CD, all {%) ; ABC, ABD, 
ACD, all (JO 

6.10 

3 2 X 2 

18 

6 

in 

AB {%), ABC (JO 

6.11 

4 2 

16 

4 

Bn 

AB (%) 

6.12 

4X2* 

16 

8 

B?i 

ABC (M) 

6.13 

4X3X2 

24 

12 

9 n t 

AC m 7 ), ABC (*« 7 ) 

6.14 


* The factors ABC * * • are read from the left; thus the BC interaction in a 3 X 2 2 
design is the interaction between the 2 factors at 2 levels. 

f The symbol %n” denotes that the number of replicates should be a multiple of 6. 
t In these cases only the balanced design is recommended. 
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6.22 Index to Experimental Plans, The opportunities for confound- 
ing vary with the number of factors and with the number of levels of each 
factor. The plans given at the end of this chapter are those that are 
likely to be most frequently used. 

Table 6.19, which forms an index to the plans, is divided into two parts. 
The first part shows a group of designs in which any number of replicates 
may be used. In successive replicates, the experimenter may either (i) 
repeat the first replicate (with a new randomization), in which case the 
same comparisons are confounded in all replicates, or (ii) change the 
grouping into blocks in successive replicates, so as to spread the con- 
founding over a greater number of comparisons. In the plans, the fac- 
torial effects that are confounded are shown separately for each replica- 
tion. Consequently, the reader can readily see what possibilities are 
open in this direction. Where confounding can be restricted to a high- 
order interaction, as in the 2 4 design in blocks of 8 units, the first pro- 
cedure is preferable. On the other hand, with the 2 4 design in blocks of 
4 units, where one two-factor interaction must be confounded in every 
replicate, the confounding should be changed in successive replications, 
unless it is certain that one of the two-factor interactions will be neg- 
ligible, More detailed recommendations for the individual designs are 
given in section 6.3. 

The second part, table 6.196, shows a group of balanced designs, in 
which all interactions of the same order and the same type are con- 
founded to the same extent. Thus in the 3 X 2 X 2 X 2 (or 3 X 2 3 ) 
design, all first-order interactions between the factors which have 2 levels 
are equally confounded, the relative information being 8/9. The first- 
order interactions between the factor at 3 levels and any factor at 2 levels 
are unconfounded. Balanced designs are useful in cases where all inter- 
actions of the same order and type are of equal interest, so that it is un- 
desirable to confound some more fully than others. Also, the computa- 
tions are usually simpler. For this reason, only the balanced designs 
should be used in certain cases, as indicated in table 6.19. On the other 
hand, with balanced designs the available numbers of replications are 
severely limited. 

Table 6.20 shows a number of designs for which plans are given in the 
references cited. These designs are mainly for mixed series, and will not 
be discussed here. 

6.23 Randomization. The procedure is the same for all designs. 

1 . Arrange the positions of the blocks at random within each replica- 
tion, using a new randomization for each replication. 
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2. Allot the treatments in any block at random to the units in the 
block, using a new randomization for each block. 


TABLE 6.20 Confounded designs for other factorial experiments 


Typn 

Units 

per 

block 

Number of 
replicates 

Interactions confounded 

Reference 

3 2 X 2 2 

12 

2 n 

AB {y 8 ) } A BCD 

6.6 

3 s X 2 

IS 

2 n 

ABC, A BCD 

6.6 

3 3 X 2 

6 

2 n 

AB, AC, BC {%), ABD, ACD , BCD , 

ABC 6.1 

4 X 3 2 

12 

2 n 

BC CA), ABC 

6.6 

4- X 2 

10 

Any 

ABC 

6.6 

X 

CO 

12 

3 n 

AB m 7 ), ABC 

6.6 

4 3 

16 

Any 

ABC 

6.7 

4 X 2 3 

S 

3 n 

ABC, ABD, ACD 

6.6 

4“ 

16 

Any 

ABC, ABD, ACD , BCD 

6.7 

5 2 

5 

4a 

AB{%) 

6.8 

5 X 2 2 

10 

on 

BC (»«b), ABC 

6.6 

5 s 

25 

Any 

ABC 

6.8 


6.24 Experiments in Single Replication. In most of the plans at the 
end of this chapter, the total number of treatment combinations is large. 
This fact limits the number of replications that can be employed. Some- 
times the resources are sufficient for only a single replication unless the 
experimenter omits some factors which he originally intended to include. 

With only one replication, we cannot derive an estimate of error from 
the interactions of treatments with blocks. If, however, certain high- 
order interactions are negligible, their mean squares in the analysis of 
variance will behave exactly like components of the error m.s., and there- 
fore can be used to provide an estimate of error. In the 2 n series, the 
smallest design for which a single replicate is likely to be used is the 2 6 in 
blocks of 8 units (plan 6.5). In replicate 1 of this plan, ABC, ADE, and 
BCDE are completely confounded. If the remaining three-, four-, and 
five-factor interactions furnish, the estimate of error, the analysis of vari- 
ance appears as follows. 

dX 

Blocks 3 

Main effects 5 

Two-factor interactions 10 

Error (from high-order interactions) 13 


Total 


31 
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Other designs in which a sufficient number of error di. are available 
from interactions among three or more factors are the 2 6 , the 3 a , and the 
3 4 . The interactions that constitute the estimate of error should be 
chosen before the results have been inspected, Recommendations for 
each of the designs above are given in the notes on the plans (section 
6.3). It may sometimes be wise to deviate from these recommendations. 
For instance, in the 2 5 example above, the main effects of A, C, and D 
and the AC and AD interactions might all turn out to be rather large. 
This would suggest that the ACD interaction may not he negligible, and 
that it should not be included in the interactions used as error. It will 
be realized that a decision of this type should be made before examining 
ACD itself. The practice of examining the high-order interactions and 
using those that are small as error leads to a serious underestimation of 
the true error variance. 

The estimation of error from high-order interactions is, of course, open 
to criticism. If some of these interactions happen to be large, the error 
m.s. that is used will overestimate the true error variance and the fact 
that the interactions are large will not be discovered. Where a consid- 
erable series of experiments of the same general type is being conducted, 
some safeguard is obtained both by examining the high-order interactions 
in experiments that are replicated and by watching the two-factor inter- 
actions in experiments with single replication. If most two-factor inter- 
actions are small, it seems very unlikely (though still possible) that inter- 
actions of higher order will be large. On the other hand, if many two- 
factor interactions are found to be large, this suggests that some three- 
factor interactions may also be substantial For further discussion, see 
Yates (6.5) and Cornish (6.10). 

6*25 Experiments with Fractional Replication, More recently several 
writers, Finney (6.11) and (6.12), Plackett and Burman (6*13), and 
Kempthorne (6.14) have considered the use of only part of a replication 
in a factorial experiment* Not enough experience has accumulated to 
permit appraisal of the utility of these designs, and only an introduction 
to them will be given. They are most likely to prove useful where (i) a 
large number of factors is tested, so that even a single replication is not 
feasible, and (ii) all high-order interactions are known with confidence to 
be negligible* 

In order to see what happens when the experiment contains only part 
of a replication, consider a 2 3 factorial in which only the 4 treatment 
combinations a , 6, c , and abc are tested. This is half of a complete rep- 
licate. It is clear that the three-factor interaction ABC cannot be esti- 
mated at all, since this represents the difference between the *4 treatments 
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that are tested and 4 that have not been tested. Finney (6.12) has called 
this comparison the defining contrast , since it defines the two halves into 
which the replicate is divided. Further, the main effect of A, as esti- 
mated from the difference between units with a and units without a; i.e., 

(abc) + (a) - (6) - (c) 

is exactly the same comparison as the BC interaction, as estimated from 
the difference between units that receive an even number of V s or c’s and 
those that receive an odd number. Similarly, the estimate of B is the 
same as that of AC, while C is the same as A B, Factorial effects that are 
represented by the same treatment comparison are called aliases ; thus 
BC is an alias of A . It has been shown (6. 12) that the alias of any effect 
is its generalized interaction with the defining contrast. For example, 
the interaction of A with ABC is A 2 BC or BC, This result makes it 
easy to discover the alias of any factorial effect. 

If the experiment shows an apparent effect of A , there is no way of 
telling from the results whether the effect was due to A f or to the BC 
interaction, or to a mixture of the two. This type of information is too 
vague to be satisfactory. If, however, it can be taken for granted that 
all two-factor interactions are negligible, the ambiguity disappears, since 
we would conclude that an apparent effect of A was in fact due to A. 
With these assumptions, the half-replicate enables us to estimate the 
three main effects. 

This example brings out the fact that in order to obtain definite in- 
formation, it is necessary to assume that certain interactions are non- 
existent. The designs discussed by Finney and Kempthorne are adapted 
for situations where all interactions among three or more factors are 
negligible. In those presented by Plackett and Burman, even two-factor 
interactions are ignored. 

We will examine the construction of a half-replicate of the 2 6 factorial, 
since this is probably the smallest member of the 2” series for which 
fractional replication might be needed in practice. It is desired to obtain 
information on both main effects and two-factor interactions. First 
divide the treatment combinations into two sets of 32 by means of the 
defining contrast ABCDEF. Either set may be chosen as the half- 
replicate, The factorial effects are shown in table 6.21. 

It will be noted that, rather fortunately, every alias of a main effect 
or two-factor interaction is a high-order interaction of a type that is as- 
sumed non-existent. The alias of a three-factor interaction is another 
three-factor interaction. Since all three-factor interactions are assumed 
negligible, these components may be used to estimate error. Conse- 
quently, this plan, if set out as a completely randomized design in 32 
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units, enables all main effects and two-factor interactions to be estimated 
and provides 10 d.f. from three-factor interactions for the estimation of 
error. 

Since a completely randomized design is rather inaccurate for many 
types of experiment, an arrangement in blocks of 16 or 8 units may be 
preferable. If the 32 units are divided into two groups of 16, one fae- 

TABLE 6.21 Structure of a 2 f) factorial ( half-re plicate) 


Main effects 

Alias 

Main effect s 

Alias 

A 

BCDEF 

D 

A BCEF 

B 

AC DEF 

E 

ABCDF 

C 

ABDEF 

F 

ABODE 

Two-factor interactions 

Alias 

Tw< >-f actor interactions 

Alias 

AB 

CDEF 

BF 

ACDE 

AC 

BDEF 

CD 

ABEF 

AD 

BCEF 

CB 

ABDF 

AE 

BCDF 

CF 

ABDE 

AF 

BCDE 

DE 

ABCF 

BC 

A DEF 

DF 

ABCE 

BD 

ACEF 

EF 

A BCD 

BE 

ACDF 



Three-factor interactions 


Three-f ae tor in 1 eractions 


(used as error) 

Alias 

(used as error) 

Alias 

ABC 

DEF 

ACE 

BDF 

ABD 

CEF 

ACF 

BDE 

ABE 

CDF 

ADE 

BCF 

ABF 

CDE 

ADF 

BCE 

ACD 

BEF 

AEF 

BCD 


torial effect and its alias are confounded with groups. The best choice is 
one of the aliases from the set used as error, e.g., ABC and DEF . The 
plan is easily constructed. Suppose that the set of 32 which contains 
abcdef has been chosen as the half-replicate. This is divided into two 
groups of 16 such that one group (block 1) contains all treatment com- 
binations carrying an even number of the letters a, b, or c. The plan and 
analysis are shown in table 6.22. 

For a design in blocks of 8 units, we must confound with blocks an- 
other alias pair from the comparisons that are used as error. Suppose 
that ABD and its alias CEF are selected. It will be recalled (section 
6.13) that the generalized interaction of this pair with ABC and DEF is 
also confounded. By the usual rule, this interaction is the alias pair CD 
and ABEF, It is unfortunate that a two-factor interaction is con- 
founded, but further trial shows that this is unavoidable in blocks of 8 
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TABLE 6.22 A half-replicate of a 2 8 factorial in blocks of 16 units 


Block 1 

Block 2 



(1) 

ad 



de 

ae 



if 




ef 

bd 



ab 

he 


d.f. 

ac 

bf 

Blocks 

1 

be 

cd 

Main effects 

6 

abdc 

ce 

Two-factor interactions 

15 

abdf 

cf 

Error (from three-factorial 


abef 

adef 

interactions) 

9 

ac.de 

bdef 


— 

acdf 

edef 

Total 

31 

acef 

abed 



bode 

ahee 



bcdS 

abef 



beef 

ahedef 




Note that the error d.f. are reduced from 10 to 9, since 1 d.f. has been assigned to 
blocks. 

units. To construct the plan (table 6,23), divide each block of table 6,22 
into halves that contain respectively an odd and even number of the 
letters c and d. 

TABLE 6.23 A half-replicate of a 2 6 factorial in blocks of 8 units 
Block 


1 

2 

3 

4 



(1) 

de 

ae 

ad 


d.f. 

ef 

<V 

of 

bd 

Blocks 

3 

ab 

ac 

he 

ce 

Main effects 

6 

abef 

be 

bf 

cf 

Two-factor interactions 

14 

aede 

abde 

cd 

a bee 

Error 

8 

acdf 

abdf 

abed 

abef 


— 

bede 

acef 

edef 

adef 

Total 

31 

bedf 

beef 

abedef 

bdef 




In practice, the letters should be allocated to the factors so that CD 
is the two-factor interaction that is considered least important. In the 
analysis of variance, the sum of squares for blocks, main effects, and the 
unconfounded two-factor interactions are all computed in the usual way. 

Other experiments for which plans are described in reference (6.12) are 
a half-replicate of a 2 7 factorial in blocks of 8 units, a half-replicate of a 
2 s factorial in blocks of 16, a quarter-replicate of a 2 8 factorial in blocks 
of 16, and a third-replicate of a 3 5 in blocks of 9. 
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For the case where two-factor interactions are ignored, Plackett and 
Burman (6.13) give methods for constructing a large number of plans in 
which all main effects can be estimated. The only restriction is that the 
number of units tested is a multiple of 4. For instance, there is a plan 
for a 2 7 factorial, using only 8 units (one-sixteenth of a replicate). All 
seven main effects can be obtained, though no comparisons remain for 
an estimate of error. There are further plans in 12, 16, 20, etc., units, 
giving 4, 8, 12, etc., d.f. for error. All these plans are set out for a com- 
pletely randomized design: with further confounding a reduced block 
size could be employed. As an illustration the plan with 16 units is given 
below. To facilitate study of this plan, the treatment combinations are 
shown both in our usual notation and by means of 4 and — signs to 
denote the two levels of each factor. 

Every factor occurs 8 times at each level. For every pair of factors, 

each of the sign combinations, 44, +— , —4-, and , occurs four 

times. Thus the estimates of all seven main effects are mutually orthog- 
onal, and are calculated in the usual way. With complete randomiza- 
tion, 8 d.f. are available for the estimation of error. 


TABLE 6.24 2 7 Factobial in 16 units (one-eighth kepucate) 


Unit 

Our 

notation 

Treatment combination 

4 and — notation 

1 a 

h 

c 

d 

e 

f 

Q 

1 

(i) 






_ 

_ 

2 

dg 

— 

- 

— 

4 

- 

- 

4 

3 


— 

- 

4 

— 

- 

4 

4 

4 

cdf 

— 

- 

4 

4 

— 

4 

— 

5 

bef 


4“ 

— 

— 

4 

4 

— 

6 

bdefg 

— 

4 

~ 

4 

4 

4 

4 

7 

bceg 

— 

4 

4 

— 

4 

- 

4 

8 

beds 

— 

4 

4 

4 

4 

— 

— 

9 

ae 

4 

— 

— 

— 

4 

- 

— 

10 

adeg 

4 

- 

- 

4 

4 

- 

4 

11 

acefg 

4 

— 

4 

— 

4 

4 

4 

12 

aedef 

4 

- 

4 

4 

4 

4 

— 

13 

dbf 

4 

4 

— 

— 

— 

4 

— 

14 

abdfg 

4 

4 

- 

4 

— 

4 

4 

15 

abeg 

4 

4 

4 

— 

— 

— 

4 

16 

abed 

4* 

4 

4 

4 
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Since it is not clear how frequently all factors can be assumed to be 
additive in their effects, the utility of these designs remains to be seen. 
As Yates (0.5) lias pointed out, one case where there is additivity occurs 
when each factor represents an object to be weighed. Suppose that 
weighings are done on a chemical balance. Testing the + level of a 
factor corresponds to placing the object in the right-hand pan; testing 
the — level corresponds to placing the object in the left-hand pan. 

Thus the rows of table 6.24 provide a set of instructions for perform- 
ing 16 successive weighings on 7 objects. In the first weighing, all ob- 
jects are placed in the left-hand pan; consequently the weight that is 
read is the total weight of all 7 objects. In the second weighing objects 
d and g are placed in the right-hand pan and the others in the left-hand 
pan, and so on. This series of 16 weighings enables us to estimate the 
weight of each object as precisely as if all weighings had been devoted to 
that object alone. To make the analogy complete, a balance with a zero 
error would be needed, the zero error corresponding to the mean yield of 
an experiment, which can also be estimated from the 16 results. For a 
discussion and references, see Mood (6.15). 

6.3 Notes on the Plans and Statistical Analysis 

6.31 2 n Series. In plans 6.1 to 6.6, one level of a factor is denoted 
by the corresponding letter; the other level by the absence of the letter. 
With designs that are not balanced, care must be taken to allocate the 
letters a, b, c, . . . to the factors in the experiment so that the interactions 
which are confounded are those that the experimenter has decided to 
confound. For instance, the first replication of plan 6.4 (2 4 design in 
blocks of 4) confounds the interaction AB, but no other two-factor inter- 
action. The factors labelled A and B should be those whose interaction 
is of least interest. 

The statistical analysis follows the same general method for all 2 n de- 
signs. The total s.s. and the sum of squares among blocks are obtained 
by the usual methods. The factorial effects may be computed by one of 
the methods outlined in section 6.14. When this calculation has been 
completed, the sums of squares are added for all unconfounded compar- 
isons. Let this total be S T . If all the confounding is complete, Sr is the 
treatment s.s., since the effects that are confounded appear already 
in the block s.s. For each comparison which is partially confounded, the 
sum of squares is calculated separately from those replications in which the 
comparison is mconfounded. The replications to be used are clear from 
the plans, which show the factorial effects that are confounded in each 
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replication. The total of these sums of squares is added to St to give 
the treatment s.s. The error s.s. is found by subtraction. 

f factorial , blocks of 4 units . Plan 6.1, which confounds ABC com- 
pletely, may be used for every replication. For a worked example, see 
reference (6.1), p. 19. 

Alternatively, with 4, or a multiple of 4, replicates, we may confound 
partially all two-factor interactions and the three-factor interaction, the 
relative information on each comparison being 3/4. The plan and sta- 
tistical analysis are described in reference (6.1), p. 21. 

factorial , blocks of 8 units. Plan 6.2 should be used for every replica- 
tion, ABCD being completely confounded. 

Note that the treatment s.s. contains only 14 d.f. since the single de- 
gree of freedom for ABCD is included in the blocks. 

B 4 factorial , blocks of 4 units. Plan 6.4 gives a balanced design with 6 
replications. If a smaller number of replicates is used, the replicates 
chosen depend on the interactions which the experimenter decides to 
confound. Since one two-factor interaction is confounded in each rep- 
licate, it will usually be preferable to confound different comparisons in 
successive replications. For instance, if AB and CD are to be con- 
founded, replicates 1 and 6 should be selected. This arrangement gives 
1/2 relative information on AB, CD and the three-factor interactions 
ACD, BCD , ABC, ABD, all other comparisons being clear of blocks. 

With a 2 4 experiment, the investigator may be uncertain whether to 
confound in blocks of 8 units or in blocks of 4 units. The former design 
is recommended unless (a) the interactions which are confounded with 
blocks of 4 units are known to be small or (6) the block of 4 units is much 
more homogeneous than the block of 8 units. The latter condition may 
apply where the experimental material is extremely variable, or falls 
naturally into groups of 4 units. 

B 5 factorial, blocks of 8 units . The balanced design (plan 6.5) requires 
5 replicates. When fewer than 5 replicates are used, it may be satis- 
factory to confound completely two three-factor interactions and one 
four-factor interaction. In this case replication 1 is repeated the re- 
quired number of times. 

If some information is desired on all treatment comparisons, replica- 
tions 1,2, * — may be used up to the proposed number. With replications 
1 and 2, for instance, the interactions ABC, ADE, ABD, BCE , BCDE, 
and ACDE are partially confounded, each with relative information 1/2. 

The statistical analysis is described in reference (6.1), p. 27, for the 
case in which the experiment contains only a single replication. 

B 6 factorial , blocks of 16 units . Plan 6.3, which shows a single replica- 
tion with ABCD, ABEF, and CDEF confounded, may be used for all 
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replications. The 3 interactions above are completely confounded, and 
are omitted from the treatment s.s. in the analysis of variance. 

If there is only 1 replication, an estimate of error may be obtained 
from the unconfounded four-, five-, and six-factor interactions. These 
comparisons supply 22 d.f., of which three are confounded with blocks, 
leaving 19 for the estimate of error. The separation of the 63 d.f. is then 
as follows: 

d.f. 


Blocks 3 

Main effects 6 

Two-factor interactions 15 

Three-factor interactions 20 

Error (from high-order interactions) 19 


factorial , blocks of 8 units . In plan 6.6, the size of block is reduced 
from 64 to 8 units without confounding any two-factor interactions. 
Four of the 20 three-factor interactions and 3 of the 15 four-factor inter- 
actions are confounded in each replication. The balanced design requires 
10 replications, i.e., 640 experimental units, and necessitates a large ex- 
periment. 

With a single replication, the 63 d.f. in the analysis of variance sub- 
divide as follows: 

d.f. 


Blocks 7 

Main effects 6 

Two-factor interactions 15 

Three-factor interactions 16 

Error (from liigh-order interactions) 19 


The simplest procedure when there is more than one replication is to 
confound completely a single set of 7 factorial effects by the repeated use 
of replication 1. For partial confounding, the successive replicates in 
plan 6.6 may be taken. In the first 4 replicates no three-factor inter- 
action is confounded more than once. 

6.32 3” Series, From plan 6.7 onwards, the levels or variations of 
each factor are denoted by the figures 0, 1, 2, • *, while the factors are 
read from left to right. The treatment 102 implies the middle level of 
the first factor, the lowest level of the second factor, and the highest 
level of the third factor. In an experiment the numbers 0, 1, 2, * — may 
be assigned to the levels or variations of a factor according to any con- 
venient scheme. 

The testing of three instead of two variations of each factor makes 
possible a more thorough evaluation of the effects of the factors. In 
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particular, the 3 3 factorial experiment has been much used in investiga- 
tions of the responses to fertilizers, since the effects of each of 3 different 
plant nutrients may be explored at 3 levels of application, which provide 
some information on the shape of the response-curve, 

8 3 factorial , blocks of 9 units. In plan 6.7, two of the 8 d.f . for ABC are 
completely confounded in each replication, so that the balanced design 
requires 4 replicates and provides 3/4 relative information on ABC. 
Unless the three-factor interactions are known to be negligible, in which 
case replication 1 may be used for all replicates of the experiment, it is 
preferable to select the required number of replications in succession 
from the plan. For example, an experiment with 3 replicates gives 2/3 
relative information on 6 of the 8 interaction d.f., and full replication on 
the remaining 2 d.f. As usual, the rule in the analysis of variance is to 
estimate confounded comparisons from those replications in which they 
are not confounded. An example of the analysis with 4 replicates has 
already been given (section 6.17). 

For an example of the analysis with a single replicate, see reference 
(6.1), p. 53. If the three-factor interactions are used as error, the separa- 
tion of the degrees of freedom is as shown on the left in table 6.25. 

TABLE 6.25 Subdivision or the degrees of freedom in a 3 3 experiment 

WITH SINGLE REPLICATION 


I. Three-factor interactions II. For experiments where all 

used as error effects are approximately 



d.f. 

linear 

di. 

Blocks 

2 

Blocks 

2 

Main effects 

6 

Main effects 

6 

Two-factor interactions 

12 

AB: linear X linear 

1 

Error 

6 

AC: linear X linear 

1 


— 

BC: linear X linear 

1 

Total 

26 

Error 

15 


Total 26 


The number of error d.f., 6, is rather small. There is an alternative 
scheme for experiments where the effects of the factors are linear, or 
nearly so. Under these circumstances, the only components of the two- 
factor interactions that are likely to be substantial are the interactions of 
the linear responses to the factors. Hence, nine of the 12 d.f. for two- 
factor interactions are added to the material used for an estimate of error, 
as shown in table 6.25, II. If in addition the linear components of 
the two-factor interaction are large, a further variant is to remove from 
the error the ABC ; linear X linear X linear term. This leaves i4 di. 
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for error. The calculation of these components should be fairly simple 
with the help of the example worked in section 6.17. 

3 4 factorial , blocks of 9 units. With 4 factors each at 3 levels (plan 6.8) 
the block size is reduced from 81 to 9 units by confounding three-factor 
interactions. There are 4 three-factor interactions: ABC, ABD, ACD , 
and BCI) , each with 8 di. Two components from each set of 8 are con- 
founded in any replication. If the experiment has more than 1 replicate, 
partial confounding is generally preferable, the required number of rep- 
licates being selected from the balanced set. The balanced design, with 
4 replicates, gives threefold replication on all three-factor interactions. 

Statistical analysis with a single replication . An analysis may be cal- 
culated without difficulty from the following partition of the degrees of 
freedom. 

d.f. 

Blocks 8 

Main effects 8 

Two-factor interactions 24 

Error (from three- and four-factor interactions) 40 


All sums of squares are calculated in the usual way. For the main effects 
and two-factor interactions, six 3 X 3 tables are required. The error, 
which is found by subtraction, contains 24 di. representing uncon- 
founded three-factor interactions and the 16 d.f. for the four-factor inter- 
actions. 

This analysis gives no test of the three-factor interactions. If a test 
of ABC, for example, is wanted, the first step is to calculate in the usual 
way the sum of squares for the 8 di. From this we must subtract the 
sum of squares for the 2 di. that are confounded with blocks. 

In plan 6.8, replication I, the 2 di. are denoted by ABC IV, where the 
numeral IV refers to replication IV of plan 6.7. If we ignore D, it will 
be found that blocks 1, 6, and 8 contain the same set of 9 treatments as 
block IYa in plan 6.7. Calculate the total yield of these 3 blocks (27 
units). Similarly, we calculate the total yield of blocks 2, 4, and 9, which 
correspond to block IV6 in plan 6.7, and of blocks 3, 5, and 7, which cor- 
respond to block IVc. The sum of squares of deviations of the 3 totals, 
divided by 27, is subtracted from the sum of squares for the 8 di. The 
resulting 6 d.f. are removed from the error, which now has 34 di. 

The reader may verify that the 2 confounded degrees of freedom for 
BCD (i.e.j BCD III) are obtained from the totals of blocks 1, 2, 3, blocks 
4, 5, 6, and blocks 7, 8, 9. In this case A is ignored when plan 6.8 is com- 
pared with plan 6.7. 

Statistical analysis with, more than one replication . If all three-factor 
interactions are presumed to be negligible, they may be combined with 
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the error. The sums of squares for blocks, main effects, and two-factor 
interactions are found in the usual way while the remainder constitutes 
the error. 

All 32 components of the three-factor interactions may, however, be 
calculated and tested. Partially confounded comparisons are taken only 
from replicates in which they are unconfounded. For example, if there 
are two replications of plan 6.8, all the data are used in the calculation of 
components ABC I and ABC II, which are uneonfounded. For ABC III 
we use only the data from the first replicate of the experiment, and for 
ABC IV only the data from the second replicate. 

6.33 Mixed Series. 8 X @ 2 factorial , blocks of 6 units. Only the bal- 
anced plan (6.9) should be used; this may be repeated if extra replica- 
tions are wanted. Thus the number of replicates must be 3, or some 
multiple of 3. Although the BC interaction is partially confounded, the 
relative information is high (8/9). 

The statistical analysis is described, with a numerical example, in 
reference (6.1), p. 58. 

8 2 X 2 factorial , blocks of 6 units. In plan 6.11, AB and ABC are 
partially confounded. Either 2 or 4 replicates may be used, though 4 
are required for complete balance. A numerical example for 4 replicates 
was given in section 6.19. 

With 2 replicates (say I and II in plan 6.11) the I and R components 
are partially confounded, while the / and S components are uncon- 
founded. The I and R totals are adjusted by the same formulae as is 
table 6.16, (iii) and (iv). The J and S totals need no adjustment. The 
interaction s.s. in the analysis of variance are computed as follows. 

AB{1) = MVh'f + (2 h’) 2 + (2i V) 3 ] - tfbWi' + 2 l 2 ' + 2J s f 
AB(J) - T Wi 2 + + / 3 2 ] - -hUi + J 2 + J s ? 

ABC(R ) = + (2 R 2 ') 2 + (2 ZV) 2 ] 

- Jjr [22V + 2 R 2 ' + 2R. s f 

ABC(S ) = tV [^ 2 + S 2 2 + S 3 2 ] - + S 2 + S3J 2 

The AB two-way table and the table of individual treatment means 
are adjusted in the same way as with four replicates. The quantities 21', 
2 R' are divided by 18 and 6, respectively, to obtain i' and r' . For i and 
r, we divide I and R each by 12. After taking deviations from the mean 
in each case, we calculate Si and Sr. Thereafter the adjustment proceeds 
exactly as with 4 replicates, being somewhat simpler since J and $ com- 
ponents do not enter. 
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8 X factorial, blocks of 6 units. In plan 6.10 the interactions BC, 
BD, and CD, between pairs of factors that occur at 2 levels, are partially- 
confounded with 8/(3 relative information, while 5/9 relative information 
is retained on the interactions ABC, ABD, and ACD. Only the balanced 
design which requires 8 replicates is recommended. 

The following computations lead to the sum of squares due to partially 
confounded effects. Let 


(Jl — I« -f" If) lo Id! Oi — 1<» If) d - I« Id! 

Pi" = I. - h - I« + Id 

where T a is the total of block J a , etc. Similar definitions hold for p 2 , • • ■ , 
in the other replications. Then find 

3 Q = 3 [CD] + gi+g 2 + g 3 ; 3 Q' = 3 [BD] + gf + g 2 ' + gf; 

3 Q" = 3[BC] + ffl " + g 2 " + p 3 " 


3Ro = 3[CD a ] — <7i J r p 2 + P3! 322 i = 3[CD 0I ] + pi + g 2 — p 3 ; 


3R 2 = 2>[CDa^ + Pi — p 2 + 93 

3Rq' — 3 [BD ao ] + gi — g 2 + p 3 '; 3Rf — 3[BD ai ] ~ gf + g 2 + Ps'; 
3RA — 3[J3D a J + Pi' + p 2 ' — Qz 


3 Ro" = 3 [BC a ,] + gi" + p 2 " - p 3 " ; 3 R x " == 3[3Cy + Pi" - p 2 " + Ps" ; 


3 R 2 " = 3 [BCJ - pi" + p 2 " + p 3 " 

The sum of squares for the two-factor interactions are as follows. 


CD 


mr 

576 


iA2 


BD 


(3Q0 

576 


y /\2 


BC = 


(30") 

576 


For ACD, divide the sum of squares of deviations of the quantities 3R 0 , 
3Ri, 3R-2 by 120, with similar rules for ABD and ABC. 

The CD, BD, and BC two-way tables must be adjusted for block ef- 
fects. For the means per unit in the BC table, the adjustment is 

30" _ [BC] 

192 ~ 72 


The adjustment is added (algebraically) when b and c are both present 
or both absent; when b or c alone is present the adjustment is subtracted. 

factorial, blocks of 4 units. In plan 6.12 each replication confounds 
completely a set of three of the 9 d.f. for AB. Since the relative infor ma - 
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tion on AB is 2/3, this plan gives less precise estimates of the interactions 
than randomized blocks unless the reduction in block size brings a sub- 
stantial decrease in the error m.s. The balanced design, which requires a 
multiple of 3 replicates, is preferable. 

With 3 replicates the analysis of variance is as follows* 

d.f. 

2 
9 
6 
9 

21 

Total 47 

To compute the sum of squares for AB, let B mn be the total of the nth 
block in the mth replicate, and let T mn be the total of all treatments that 
appear in this block (this will be a total over 12 units). Let P mn = 
T mn ~ B mn , so that there are 12 quantities P mn . Then 

AB = i(P u 2 + P 12 2 + • • • + P 34 2 ) - MG' 2 + Pi 2 + Pn 2 + Pm 2 ) 
where G is the grand total and the RA are replication totals. 

The estimated block mean (adjusted for treatment effects) is 

3 Bmn Pmn 


To present a two-way table, the adjusted mean of treatment (00) is, for 
example, 

( 00 )' = ( 00 ) — jf(bn + biu + bun) 

since this treatment occurs in blocks lb, II a, and Illh. 

4 X $ 2 factorial , blocks of 8 units. In plan 6.13, one of the three com- 
ponents of ABC is confounded in each replicate. In other words, the dif- 
ference between the treatments in block la and those in block lb is one 
component of ABC , The plan gives 2/3 relative information on ABC. 
Any number of replicates may be used, though three are needed for 
balanced confounding. 

To obtain the sum of squares for ABC, calculate the total for each 
component separately, taking it only from those replicates in which it is 
not confounded. The divisor for the square of this total is obtained by 
the usual rules. Thus with 2 replicates the first component is taken from 
replicate II and has divisor 16, the second component is taken from rep- 
licate I and has divisor 16, while the third component comes from both 
replicates and has divisor 32. 



Replicates 

Blocks within replicates 
Main effects 
Two-factor interactions 
Error 
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If there are r replicates, the adjusted block mean is 

?'B m n Tmn 

mn * 


where B mn is the block total and T mn that of all treatments appearing in 
the block. With 2 replicates, the adjusted mean of treatment 111 is 


(HI)' - (HI) - 


ha + hjb 

- 


since this treatment is found in blocks la and IT??. 

4 X 8 X 2 factorial, blocks of 12 units. The 3 d.f. for the factor A at 
4 levels may be divided into the following components. 

A' = a 3 + — a\ — a 0 


A" ~ as — a 2 - ai + a 0 


A tn = a 3 — a% + a\ ~ a,Q 

In the first 3 replicates of plan 6.14, the interactions A'C and A'BC 
are partially confounded, with 8/9 and 5/9 relative information, re- 
spectively, the interactions of A n and A ,n being unconf ounded ; A’ r G and 
A ,f BC are partially confounded in replicates 4 to 6 and A'"C and A n/ BC 
in replicates 7 to 9. Although nine replicates are required for complete 
balance, the design may be used with three or six replicates. In these 
cases the last three or the last six replications should be taken from plan 
6.14. The AC interaction suffers only a trivial loss of replication from 
the confounding. The statistical analysis is described in reference (6.6). 


6.34 Missing Values. When factorial experiments are arranged in 
randomized blocks or latin squares with no confounding, the formulae 
previously given (sections 4.25, 4.36) are used for the estimation of miss- 
ing values. With confounding, the formulae are changed, becoming in 
general more complicated. The different cases are discussed below. 

i. Complete confounding of the # 4 , 2 5 , factorials. When 

the effects that are confounded are unimportant, we recommended that 
the first replication of the plan be used for all replications of the experi- 
ment, with complete confounding of these effects. In this case the esti- 
mation formula is simple. Let B be the total yield of all other units in 
the same block as the missing unit, and T be the total yield of all other 
units which have the same treatment as the missing unit. Every replica- 
tion contains a block which has the same set of treatments as the block 
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with the missing value. Let B' be the total yield of all such blocks (in- 
cluding the block with the missing value). 

The value for the missing unit is estimated by the formula: 

_ rB + kT ~~ W 

where k is the number of units per block, and r is the number of replica- 
tions. 

It will be realized that this is the standard formula for randomized 
blocks (section 4.25), applied only to those blocks which contain the 
treatment with the missing value. 

ii. Partial confounding of the # 3 , # 4 , 0 6 , # 3 , S 4 , //, 4 X factorials. 

The formula given later is valid when a different replication of the plan 
is used for each replication of the experiment. A further essential con- 
dition is that no treatment comparison be confounded in more than one 
replication. This condition is satisfied in all plans for the designs above, 
except possibly plan 6.4 (2 4 in blocks of 4) and plan 6.6 (2° in blocks of 
8). For instance, if the first 2 replications of plan 6.6 are used, no inter- 
action is confounded more than once; but if the first 3 replications are 
used, BCDE is confounded twice and the formula does not apply. 

The formula requires some preliminary calculations, most of which 
can be used in the subsequent analysis of variance. In addition to B and 
T, as defined in case i, we require first the total R of all other units in the 
same replicate as the missing unit and the grand total G of all other units 
in the experiment. In the other replicates, calculate the total Sb of all 
blocks which contain the treatment with the missing unit. 

Next form the table of treatment totals. From this table compute 
the sum U of the observations for all other treatments which appear in 
the block which has the missing unit. In the other replications, calculate 
the corresponding sum for each block which contains the treatment with 
the missing unit. Let the total of these (r — 1) sums be V, where r is 
the number of replicates. Note that in the calculation of U and V the 
treatment with the missing value is not included. 

The missing yield is estimated by the equation 

kt(r - 1)T + tr(r - 1 )B + kO + tV - krR - f(r - 1)17 - trS b 
y (r - l)[<(r - 1)(* - 1) - (t - 4)] 

The table on p. 204 which shows the number of units whose observa- 
tions are added to obtain the various totals T, B, • • • , may be helpful in 
cheeking that the totals have been correctly identified. 

Consider U, for example. There are ( k — 1) other treatments in the 
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block with the missing value, and each treatment total is the sum of r 
observations. Consequently V is a total of r(k - 1) individual observa- 
tions. 


Total 

T \ 

* 

G \ 

: 

u 1 

! 

a | 

V 

St 

Number of 
units 

(r - 1) 

(k - 1) 

(tr - 1) 

r(r - 1 )(& — 1) 

a - 1 ) 

! r(Jc - 1) 

Hr - 1) 


iii. Partial confounding of the 3 X $ 2 , 3 X $ 3 , 3 % X 4 X 3 X 2 fac- 
torials. For these designs the general method given by Yates (6.16) 
should be followed. 

iv. Single replication of the 3*, $ 6 , & 6 , 3* factorials. A missing value is 
estimated by minimizing the sum of squares for the interactions that are 
used as error. 

The estimation of missing values in cases iii and iv requires a knowl- 
edge of the algebra of the analysis of variance. The expressions for the 
missing value are usually fairly simple in case iv but are complicated in 
case iii. 

When the estimated value has been substituted, the analysis of vari- 
ance is calculated by the appropriate method for the design. As usual, 
the error di. are reduced by one for each estimated value. Variance- 
ratio tests of the treatment effects are slightly disturbed. For £~tests, an 
approximate rule is to decrease the number of replicates ascribed to any 
mean by 1 } 4 for each estimated value that the mean contains. Suppose, 
for instance, that (abc) is missing in a 2 s experiment with 3 replications. 
The mean of the units which receive a is taken over 12 observations, one 
of which is an estimated value. By the rule, the standard error of this 
mean is computed as s/V luj instead of s/V 12. Consequently the 
standard error of the mean response to A is taken as 


instead of 



6.35 Estimation of the Gain in Precision from Confounding. As men- 
tioned previously, the advisability of confounding depends on the 
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amount by which the experimental error is decreased when the block size 
is reduced. It is therefore worth while, especially where confounding has 
not been previously used, to estimate the gain in precision which has been 
obtained in each experiment. 

The technique is essentially the same for all types of design. From the 
results of the analysis of variance of a confounded experiment, an esti- 
mate E r is made of the experimental error which would have been 
present if the experiment had been laid out in randomized complete 
blocks. 


Single replication. Let E b be the mean square for blocks, and E e the 
mean square for error. If there is only 1 replication in the experiment, 
the estimate E r is 

E, - (6.1) 

n b + n G 

where n b = number of degrees of freedom for blocks, 

n e = total number of degrees of freedom minus n&. 

For example, with a 2 6 experiment in blocks of 8 units (single replication) 
rih = 7, n c = 63 — 7 = 56 


The estimate E r is directly comparable with the estimated error E e . As 
shown in section 2.31, the comparison may be expressed in terms of the 
relative amounts of replication required to obtain equal accuracy with 
the two types of layout. 

The estimate E r is subject to the assumption that the interactions 
which are confounded with blocks are negligible. Since confounding is 
complete, no method of estimation avoids this assumption. If the con- 
founded interactions are not negligible, the calculation overestimates the 
gain from confounding. 


More than one replication . If the confounding is complete, formula 
6.1 holds for E r , subject again to the assumption that the confounded 
comparisons are negligible. In this case 


E b « mean square for blocks within replications . 

E e — error mean square. 

m =5= number of degrees of freedom for blocks within replications , 
n e — total number of degrees of freedom minus number of degrees of 
freedom for replications minus 

Thus with 2 replicates of a 2 6 design in blocks of 8 units, the same inter- 
actions being confounded in both replicates, we have 

= 14, », — 127 — 1 — 14 — 112 
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With partial confounding, the same formula may be used if all con- 
founded comparisons are assumed to be small. Evidence on the validity 
of this assumption is provided by the experimental results, since partially 
confounded comparisons can be tested for significance. 

It is possible to avoid the assumption that confounded comparisons are 
negligible, though some extra computations are necessary. We first 
adjust the mean square among blocks within replicates so as to remove 
treatment effects. The following identity holds for the sums of squares . 

Blocks within replicates (adj.) + treatments (unadj.) = 

blocks within replicates (unadj.) + treatments (adj.) 

The two quantities on the right of the equation appear in the analysis 
of variance of the experimental results. Consequently the adjusted 
blocks s.s. is found from this equation by calculating the unadjusted 
treatments s.s. From the adjusted blocks s.s. we obtain, as shown 
below, a new estimate Ef for use in formula (6.1). 

These procedures may be illustrated from the results of the 3X3X2 
experiment in 4 replicates (section 6.19), in which AB and ABC are 
partially confounded. From table 6.15 the following summar r of the 
analysis of variance is obtained. 



di. 

s.s. 

ra.s. 

Replications 

3 

3,837 

1,279 

Blocks within replications 

8 

2,836 

354 

Treatments 

17 

7,802 

459 

Error 

43 

8,909 

207 

Total 

71 

23,384 



If AB and ABC are assumed negligible, the estimate E r is found by 
formula (6.1) for the case where there is more than one replication. 

8 X 354 + 60 X 207 

E r » » 224 

68 

In order to avoid the assumption, we calculate the unadjusted treat- 
ments s.s., which is found to be 8328. Since the adjusted treatments 
s.s. is 7802, we have 

Blocks s.s. (adj.) = 2836 + 7802 - 8328 - 2310 

corresponding to a mean square E &a of 289. 

An unbiased estimate Ef of the variance between blocks within rep- 
licates is 


iE K ~E e 4 X 289 — 207 
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This value is inserted in formula (6.1) to give E/: 

8 X 316 + 60 X 207 


E r ' 


68 


= 220 


The estimated gain in efficiency over randomized blocks is 8% when 
AB and ABC are assumed negligible and 6% without this assumption. 
The two results agree closely, as is usually the case unless the confounded 
interactions are large. 

The method above applies to all designs with partial confounding. 
For the designs discussed in this chapter, the formula for E b in terms of 
E ba and E e is 

r E ha — E e 


E b 


where r is the number of replicates. 


1 
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Plan 6.1 2 8 factorial, blocks of 4 units 

Rep. I, ABC confounded 
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Plan 6.8 2 6 factorial, blocks of 16 units 

Rep. I, ABCD, ABEF i CDEF confounded 
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Plan 6.4 Balanced group of sets for 2 4 factorial, blocks of 4 units 

Two-factor interactions are confounded in 1 replication and three-factor 
interactions are confounded in 3 replications. The columns are the blocks. 

Rep. I, AB, ACD, BCD confounded Rep. II, AC, ABD, BCD 
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Plan 6,5 Balanced group of sets for 2 5 factorial, blocks of 8 units 

Three- and four-factor interactions are confounded in 1 replication. 
Rep. I, ABC, ADE, BODE confounded Rep. II, ABD, BCE, ACDE 
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Plan 6.5 ( Continued ) 

Balanced group of sets for 2 5 factorial, blocks of 8 units 
Rep. Ill, ACE, BCD , ABDE Rep. IV, ACD, BDE r ABCE 
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Plan 6.6 Balanced group of sets for 2 6 factorial, blocks of 8 units 

All three- and four-factor interactions are confounded in 2 replications. 
Rep. I, ABC, CDE, ABF, BEF, ABDE , BCDF, ACEF confounded 
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Plan ft. 6 {Continued) 

Balanced group of sets for 2 P) factorial, blocks of 8 units 
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Rep. VI, ABC, BDE , ADF } CEF, AGUE, BCDF , ABEF 
Interchange B and C in replication I 
Rep. VII, ABF, DEF , BCD , ACT, ABM, ACDF, BCEF 
Interchange F and D in replication II 
Rep. VIII, ABE , BDF, CDE , ACF, ABEF, ABCD , BCTF 
Interchange A and E in replication III 
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Interchange F and D in replication IV 
Rep. X, AEF , BDE , ACT, BCF , A BDF, CDEF, ABCE 
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Plan 6.7 


Plan 6.8 


CONFOUNDING 

Balanced group of sets for 3 3 factorial, blocks of 9 units 
ABC confounded 
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Balanced group of sots for 3* factorial, blocks of 9 units 
Three-factor interactions confounded 
Rep. I, ABC IV, ABB I, ACD II, BCD III confounded 
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Plan 6.8 ( Continued ) 

Balanced group of sets for 3 4 factorial, blocks of 9 units 
Rep. II, ABC III, ACD IV, ABD II, BCD I 
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Rep. Ill, ABC 11, ACD III, ABD IV, BCD IV 
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Rep. IV, ABC I, ACD I, 

ABD III, BCD II 


0000 

0012 

0021 

0011 

0020 

0002 

0022 

0001 

0010 

1012 

1021 

1000 

1020 

1002 

1011 

1001 

1010 

1022 

2021 

2000 

2012 

2002 

2011 

2020 

2010 

2022 

2001 

0111 

0120 

0102 

0122 

0101 

0110 

0100 

0112 

0121 

1120 

1102 

1111 

1101 

1110 

1122 

1112 

1121 

1100 

2102 

2111 

2120 

2110 

2122 

2101 

2121 

2100 

2112 

0222 

0201 

0210 

0200 

0212 

0221 

0211 

0220 

0202 

1201 

1210 

1222 

1212 

1221 

1200 

1220 

1202 

1211 

2210 

2222 

2201 

2221 

2200 

2212 

2202 

2211 

2220 



214 

Plan 6M 


Plan 6 JO 


CONFOUNDING 

Balanced group of sets for 3 X 2 2 factorial, blocks of 6 units 
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Plan 6Jt Balanced group of sots for 3 2 X 2 factorial, blocks of (j units 

A£, ABC confounded 
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Plan 6J2 Balanced group of sets for 4 2 factorial, blocks of 4 units 
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Plan 6.13 Balanced group of sets for 4 X 2 2 factorial, blocks of 8 units 
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Plan 6 A 4 

Balanced group of sets for 4X3X2 factorial, blocks of .1 2 units 
A'C, A ! BC confounded 


Rep. I 

Rep. IT ‘ 

Rep, 

III 

a 

b 

a 

b 

a 

b 

000 

001 

001 

000 

001 

000 

Oil 

010 

010 

011 

011 

010 

02 1 

020 

021 

020 

020 

021 

100 

101 

101 

100 

101 

LOO 

111 

no 

no 

111 

111 

HO 

121 

120 

121 

120 

120 

121 

201 

200 

200 

201 

200 

201 

210 

211 

211 

210 

210 

211 

220 

221 

220 

221 

221 

220 

301 

300 

300 

301 

300 

301 

310 

311 

311 

310 

310 

311 

320 

321 

320 

321 

321 

320 



PLANS 


217 


Plan 6.14 ( Continued ) 

Balanced group of sots for 4X3X2 factorial, blocks of 12 units 
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CHAPTER 7 


FACTORIAL EXPERIMENTS WITH MAIN EFFECTS 
CONFOUNDED: SPLIT-PLOT DESIGNS 


7.1 The Simple Split-plot Design 

7.11 Description. In field experiments an extra factor is sometimes 
introduced into an experiment by dividing each plot into a number of 
parts. For example, if the experiment is planned originally to test a 
factor A with five levels, the division of each plot into halves permits the 
inclusion of an extra factor B at two levels. Within each plot the two 
levels of B are allotted at random to the two sub-plots. If the whole 
plots are in a randomized block design, the plan (after randomization) 
might appear as shown in table 7.1. It is worth noting the difference be- 


Example of a split-plot design 
Rep. 2 


TABLE 7.1 

Rep. 1 
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tween this arrangement and ordinary randomized blocks. In the latter 
the ten treatment combinations are assigned to the ten sub-plots in a 
replication completely at random. Here we have a more orderly assign- 
ment in which the two treatment combinations that have any given level 
of A always appear in the same whole-plot. 

This type of arrangement is common in industrial experimentation, 
although the connection may not at first be realized. Frequently, one 
series of treatments requires rather a large bulk of experimental ma- 
terial, while another series can be compared with much smaller amounts. 
For instance, the comparison of different types of furnace for the prep- 
aration of an alloy would use much greater amounts of alloy than the 
comparison of different types of mould into which the alloy might be 
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poured. In an experiment in which both factors are to be tested, the 
natural procedure is to take the material prepared in any furnace, and 
pour some of it into each mould. That is, material prepared in one 
furnace at one time provides a complete replication for the comparisons 
among moulds, just as the plot containing any level of A provides a com- 
plete replication of the factor B. Another instance is the comparison of 
different machines for milking dairy cows. Each machine would neces- 
sitate rather substantial amounts of milk, whereas other comparisons, 
for example on the best method of pasteurizing or of cooling, could be 
conducted with a much smaller amount of milk per treatment. The 
produce from any machine could be subdivided for these subsequent 
tests. 

We may describe this design in another way that brings out more 
clearly its relation to the confounded factorial designs discussed in chap- 
ter 6. If the sub-units are l'egarded as the experimental units, it is seen 
that the treatments a 0 , at , . . . , a 4 are applied to groups or blocks of two 
units. Differences among these blocks are confounded with differences 
among the levels of A ; i.e., the main effects of A are confounded. Ac- 
cordingly, the split-plot design is sometimes considered as one in which 
certain mam effects are confounded, as contrasted with the designs in 
chapter 6, where the confounding is restricted to interactions, 

7.12 Nature of the Experimental Error, In the statistical analysis, 
account must be taken of the fact that the observations from different 
sub-units in the same unit may be correlated. In field experiments this 
correlation is just a reflection of the fact that neighboring pieces of land 
tend to be similar in fertility and in other agronomic properties. In in- 
dustrial experiments the same correlation may be present, because any 
factor that affects the whole batch of alloy prepared in one furnace at 
one time will tend to create similarity among smaller batches poured 
from it. 

The mathematical analysis used to examine the effects of this correla- 
tion will be illustrated for the experiment in table 7.1. Let i refer to the 
replication, j to the level of A, and k to that of B : for sub-units in the 
same unit, the i and j subscripts will be the same. The assumption is 
made that a correlation p exists between the experimental errors and 
dju for any two sub-units in the same unit. Sub-units in different units 
are assumed to be uncorrelated. Mathematically, we have 

** P<? 2 } &f{Cijk&8tu) 0 

We now consider the effects of this model on the most important treat- 
ment comparisons. The main effects of A are calculated entirely from 
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unit totals or means. With two sub-units per unit; the error variance of 
a unit total is 

E(e m + e ij2 ) 2 « E(e m 2 ) + E(e m 2 ) + 2E(e ijl e ir2 ) = <r 2 + <r 2 + 2pcr 2 

= 2<t 2 (1 + p) 

The factor 2 may be regarded as representing the effect of adding over 
2 sub-units. Consequently, for the main effects of A, the appropriate 
error variance per sub-unit is a 2 (l + p). If there are ft sub-units per 
unit, the corresponding variance works out at cr 2 [l + (13 — l)p]. 

The main effects of B, on the other hand, are derived from the dif- 
ference between the two sub-units in a unit. For this we have 

E(e m - e m ) 2 « 2o- 2 (l - p) 

Thus the effective variance per sub-unit applicable to the main effects 
of B is o* 2 (l — p). This expression remains unchanged when there are 
(3 sub-units per unit. This variance also applies to any component of the 
AB interaction, since such components involve comparisons of (b x — bo) 
at different levels of A. 

For other treatment comparisons, the basic error variance may be 
different from either of the two expressions above. Consider, for in- 
stance, (a 3 b 0 — aM, a comparison of a 3 with a x at the zero level of B. 
In any replication the 2 sub-units involved are in different units, and are 
therefore independent. The variance of their difference is therefore 
2cr 2 , and the basic variance per sub-unit is a- 2 . However, the appropriate 
variance for all other comparisons of this type can be derived from the 
basic variances <r 2 [l + (0 — l)p] and <r 2 (l — p). As will be seen, the 
analysis of variance gives unbiased estimates of these two variances, 
and from these, unbiased estimates of any particular variance can be 
obtained. 

In practice, p is nearly always positive. The result is that the main 
effects of A (the factor applied to the units) are less precisely estimated 
than those of B or than the AB interaction. 

The analysis of variance is fairly easy. For the example in table 7.1 we 
first compute the 15 plot totals. Their sum of squares of deviations is 
partitioned in the usual way into 2 di. for replications, 4 for the main 
effects of A , and 8 for the experimental error applicable to a whole-plot. 
The mean square for the latter is an unbiased estimate of cr 2 (l + p). 
All computations are divided by 2 to convert them to a sub-unit basis. 

Next take the difference (b x ~~ b 0 ) on every unit. The 15 differences 
provide 1 di. which represents the main effect of 5, 4 di. which represent 
the AB interactions, and the remaining 10 di. whose mean square gives 
an unbiased estimate of the sub-plot error variance er 2 (l — p). All sums 



7-13 COMPARISON WITH RANDOMIZED BLOCHS 221 

of squares are again divided by 2. The complete separation of degrees 
of freedom is shown in table 7.2. 

TABLE 7.2 Analysis of variance for the split-plot experiment in 

TABLE 7.1 

d.f. 


Whole plots 

Replications 2 

A 4 

Wliole-plot. error S 

Total 14 

Sub-plots 

B l 

AB 4 

Sub-plot error 10 

Grand total 20 


The point to be noted is that the whole-plot error is computed en- 
tirely from plot totals, and the sub-plot error entirely from the differences 
between sub-plots in the same plot. It will be seen that the grand total 
of the degrees of freedom is 29. As might be expected, the corresponding * 
sum of squares is the sum of squares of deviations of the 30 observations 
from their mean. A computation of this quantity provides a check on all 
sums of squares. In practice, particularly with more than 2 sub-unit 
treatments, we usually find the sub-plot error by subtraction. 

7.13 Comparison with Randomized Blocks. The experiment in tabic 
7.1 might be arranged in ordinary randomized blocks, with 3 blocks of 
10 treatments each. In a consideration of the relative merits of the two 
arrangements, the following points are relevant. 

1. With the split-plot design, usually the B and the AB effects are 
estimated more precisely than the A effects. Moreover, the number of 
degrees of freedom available for the experimental error m.s, is smaller 
for whole-unit comparisons than for sub-unit comparisons. 

2. It can be shown that the average experimental error over all treat- 
ment comparisons is the same for both designs. Consequently there is no 
net gain in precision resulting from the use of the split-plot design; the 
increased precision on B and AB is obtained by the sacrifice of precision 
on A. For tests of significance or the construction of confidence limits 
the randomized block design holds a slight advantage on the average 
since it provides more degrees of freedom for the estimate of the single 
error variance that it requires. For instance, the experiment cited has 
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8 d.f. for the whole-unit error and 10 di. for the sub-unit error, whereas 
randomized blocks would provide 18 di. 

3. As we have indicated, the chief practical advantage of the split- 
plot arrangement is that it enables factors that require relatively large 
amounts of material and factors that require only small amounts to be 
combined in the same experiment. If the experiment is planned to in- 
vestigate the first type of factor, so that large amounts of material are 
going to be used anyway, factors of the second type can often be in- 
cluded at. very little extra cost, and some additional information obtained 
very cheaply. 

To summarize, the split-plot design is advantageous if the B and AB 
effects are of greater interest than the A effects, or if the A effects cannot 
be tested on small amounts of material. 

Two disadvantages have been mentioned by experimenters. Some- 
times the whole-unit error is much larger than the sub-unit error. It 
may happen that the effects of A , though large and exciting, are not 
significant, whereas those of B , which are too small to be of practical 
interest, are statistically significant. The experimenter tends to be 
uncomfortable in reporting results of this type. Secondly, the fact that 
different treatment comparisons have different basic error variances 
makes the analysis more complex than with randomized blocks, espe- 
cially if some unusual type of comparison is being made. 

When the number of replications and the experimental conditions are 
suitable, the whole units may be arranged in a latin square. A split-plot 
latin square, which eliminates the error variation arising from two types 
of grouping, may be preferable to randomized blocks. Summarizing 22 
field experiments in latin squares where the plots were split into halves, 
Yates (7.1) found a substantial net increase in precision over ran- 
domized blocks. The superiority of the latin square was so pronounced 
that even the whole-plot comparisons would have been less precisely de- 
termined in randomized block designs. Factorial combinations that lend 
themselves to the use of split-plot latin squares are the 5 X 2, 5 X 3, 
5 X 4, 6 X 2, G X 3, 7 X 2, 7 X 3, 8 X 2. 


7.14 Randomization. The treatments applied to the units are ran- 
domized according to the instructions for the design (e.g. randomized 
blocks, latin square, etc.) in which the units are arranged. The treat- 
ments applied to the sub-units are allotted at random within each unit. 
A separate randomization is carried out for each unit. 


7.15 Statistical Analysis. If the factor A (applied to the units) con- 
tains a levels, and the factor B (applied to the sub-units) contains 
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levels*, the subdivision of degrees of freedom in the analysis of variance is 
shown in table 7.3. The details of the analysis are illustrated in section 
7.17. 


TABLE 7.3 Partition of degrees of freedom for a split-plot dehign 

Units arranged in randomized blocks Units arranged in a Latin square 
(r replicates) (r =» a replicates) 


Units 

d.f. 


Units 

di. 

Blocks 

(r - 1) 


Rows 

(a - 1) 

A 

(a ~ 1) 


Columns 

(a - 1) 

Error (a) 

((x — l)(r — 

i) 

A 

( a — D 



— 

Error (a) 

(« — !}(« - 2) 

Total 

( ra — 1) 



— 




Total 

(or — 1 ) 

Sub-units 



Sub-units 


B 

Os - l) 


B 

CP - 1) 

AB 

(a - l)(/3 - 

i) 

AB 

(a - l)Qi - 1) 

Error (6) 

a(r — 1)03 - 

■ i) 

Error (h) 

^ 1 
1 

S 1 

1 1 
i 1 

Total 

m(/J — 1) 


Total 

ccHp - 1) 


7.16 Standard Errors, Let E (l and Eh be the error mean squares for 
error (a) and error (&), respectively, on a sub-unit basis. For the treat- 
ment means, also expressed on a sub-unit basis, the standard errors 
shown in table 7.4 apply. The final comparison in table 7.4 contains 


TABLE 7.4 Standard errors for the split-plot design 
Treatment comparison s.e. 

Difference between two *4 means: e.g., [(«i) — (oo)] V2 EJrfi 

Difference between two B means: e.g., f (fa) — (bo)] \ / 2Bi > ]nx 

Difference between two B means at the same level of A : 

o.g., [(«d>i) — («iW 1 ‘ V 2 Eh/r 

Difference between two A means at the same level of B: 
o.g., l(aih) - (aoh)} 


both the main effect of A and the AB interaction; consequently the ap- 
propriate error is a weighted mean of E a and AV This error also applies 
to the difference between two A means which have different levels of B. 
In such cases the ratio of the treatment difference to its standard error 
does not follow the ^distribution. For practical purposes the approx- 
imate rule of section 4.14 may be used, though this method gives slightly 
too few significant results. Let t n , t h bo the significance levels of t corre- 
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sp ending to the degrees of freedom in E a and Eh, respectively. The 
significance level of t is taken as 

(/3 — 1 )Ebh + E a t a 

t== ~W z 1)e7+e7 

For an application, see the next section. 


7.17 Numerical Example. In an experiment on the preparation of 
chocolate cakes, conducted at Iowa State College (7.2), 3 recipes for 
preparing the batter were compared. Recipes I and II differed in that 
the chocolate was added at 40°C. and 60°C., respectively, while recipe 
III contained extra sugar. In addition, 6 different baking temperatures 
were tested: these ranged in 10° steps from 175° to 225°. Each time that 
a mix was made by any recipe, enough batter was prepared for 6 cakes, 
each of which was baked at a different temperature. Thus the recipes 
are the “whole-unit” treatments, while the baking temperatures are the 
“sub-unit” treatments. There were 15 replications, and it will he as- 
sumed that these were conducted serially according to a randomized 
blocks scheme: that is, one replication was completed before starting 
the next, so that differences among replicates represent time differences. 
(The notes suggest that this was done, though they are not quite ex- 
plicit.) 

A number of measurements were made on the cakes. The measure- 
ment presented here is the breaking angle. One half of a slab of cake is 
held fixed, while the other half is pivoted about the middle until breakage 
occurs. The angle through which the moving half has revolved is read 
on a circular scale. Since breakage is gradual, the reading tends to have 
a subjective element. The data are shown in table 7.5. 

It is customary to compute the analysis of variance on a sub-unit 
basis. To avoid confusion, this should be clearly stated in the analysis 
of variance table itself. The calculations may be presented in three 
steps. 

Step 1. Analyze the whole-unit totals by the method appropriate to 
the design in which they are arranged. 


Correction factor; 


(8673) 2 

270 


m , (269) 2 + (20Q) 2 + . . * + (155) 2 
Total: — - — — L __ 278,596 

^ , « (843) 2 -j- (820) 2 + • * ■ + (479) 2 

Replications: -™ 278,596 

18 

Recipes: g * + y + BMg . ^ 


278,596 

11,538 

10,204 

185 


Error (a), by subtraction: 11,538 - 10,204 - 135 a* 1,199 
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Recipe I 


Totals 


Recipe II 


Totals 


Recipe III 


TABLE 7.5 Breaking angles (degrees) 


Temperature 


Unit, 


Rep. 

175° 

'185° 

195° 

205° 

215° 

225° 

totals 


1 

42 

46 

47 

39 

53 

42 

269 


2 

47 

29 

35 

47 

57 

45 

260 


3 

32 

32 

37 

43 

45 

45 

234 


4 

26 

32 

35 

24 

39 

26 

182 


5 

28 

30 

31 

37 

41 

47 

214 


6 

24 

22 

22 

29 

35 

26 

158 


7 

26 

23 

25 

27 

33 

35 

169 


8 

24 

33 

23 

32 

31 

34 

177 


9 

24 

27 

28 

33 

31 

23 

i 69 


10 

24 

33 

27 

31 

30 

33 

178 


11 

33 

39 

33 

28 

33 

30 

196 


12 

28 

31 

27 

39 

35 

43 

203 


13 

29 

28 

31 

29 

37 

33 

187 


14 

24 

40 

29 

40 

40 

31 

204 


15 

26 

28 

32 

25 

37 

33 

181 



— 

— 

— 

— 

— 

— 

— 

Rep. 


437 

473 

462 

503 

580 

526 

2981 

totals 


















o lO 

1 

39 

46 

51 

49 

55 

42 

282 

820 

2 

35 

46 

47 

39 

52 

61 

280 

665 

3 

34 

30 

42 

35 

42 

35 

218 

599 

4 

25 

26 

28 

46 

37 

37 

199 

583 

5 

31 

30 

29 

35 

40 

36 

201 

517 

6 

24 

29 

29 

29 

24 

35 

170 

492 

7 

22 

25 

26 

26 

29 

36 

164 

493 

8 

26 

23 

24 

31 

27 

37 

168 

492 

9 

27 

26 

32 

28 

32 

33 

178 

5X9 

10 

21 

24 

24 

27 

37 

30 

163 

538 

11 

20 

27 

33 

31 

28 

33 

172 

557 

12 

23 

28 

31 

34 

31 

29 

176 

574 

13 

32 

35 

30 

27 

35 

30 

189 

502 

14 

23 

25 

22 

19 

21 

35 

145 

479 

15 

21 

21 

28 

26 

27 

20 

143 

— 


— 

— 

- — 

— _ 

— 

— 

— . — 

8673 


403 

441 

476 

482 

517 

529 

2848 


1 

46 

44 

45 

46 

48 

63 

292 


2 

43 

43 

43 

46 

47 

58 

280 


3 

33 

24 

40 

37 

41 

38 

213 


4 

38 

41 

38 

30 

36 

35 

218 


5 

21 

25 

31 

35 

33 

23 

168 


6 

24 

33 

30 

30 

37 

35 

189 


7 

20 

21 

33 

24 

30 

33 

150 


8 

24 

23 

21 

24 

21 

35 

148 


9 

24 

18 

21 

26 

28 

28 

145 


10 

26 

28 

27 

27 

35 

35 

178 


11 

28 

25 

26 

25 

38 

28 

170 


12 

24 

30 

28 

35 

33 

28 

178 


13 

28 

29 

43 

28 

33 

37 

198 


14 

19 

22 

27 

25 

25 

35 

153 


15 

21 

28 

25 

25 

31 

25 

155 



419 

434 

476 

463 

516 

536 

2844 



Totals 

Temp, totals 


1259 1348 1414 1448 1613 1591 8673 



226 


SPLIT-PLOT DESIGNS 


7.17 


Step 2. This concerns the sub-unit treatments. Their main effects arc 
obtained directly. 


Temperatures : 


(1259) 2 + (1348) 2 4 1- (1591) 2 

45 


- 278,596 = 2100 *' 


The sum of squares for interactions between sub-unit and whole-unit 
treatments is found by subtraction. First calculate the total s.s. for the 
two-way table that shows both sets of treatments. 

Total treatments : 

^ +(CTf+ ... + (5 16 ) lt W _ , 244 , 

15 

Then, 

Recipes X temperatures: 2441 — 135 — 2100 = 200 ^ 

Step 8. Compute the total s.s. among all sub-units. 

Total: (42) 2 + (47) 2 + • • • + (35) 2 + (25) 2 - 278,596 = 18,143 

The sum of souares for error (b) is then found by subtraction in table 
7.6. 


TABLE 7.6 Analysis or variance of breaking angles (on a sub-unit 
basts) for the experiment on chocolate cakes 


Source of variation 

d.f. 

s.s. 

m.s. 

F 

Replications 

14 

10,204 



Recipes 

2 

135 

67.5 

1.58 

Error (a) 

28 

1,199 

42.8 


Temperatures 

5 

2,100*' 

420.0 

20.49 

Linear regression 

1 

1,1)67 

1,967.0 

05.95 

Deviations 

4 

* 133 

33.2 

1.02 

Recipes X temperatures 

10 

206- 

20.6 

1.00 

Error ( b ) 

210 

4,299 

20.5 


Total 

269 

18,143 v “ 




The mean square for recipes, while above that for error (a), does not 
approach the 5% level. Temperature effects are highly significant as 
compared with error (b). Since there is a fairly steady increase in the 
breaking angle with increased temperature, the sum of squares has been 
divided into the component due to a linear regression on temperature 
and that due to deviations from the straight line. The regression coeffi- 
cient amounts to an increase of 1.6° in the breaking angle for each 10° 
rise in baking temperature. The mean square for deviations is not sig- 
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nificant, though it is higher than expectation. There is no indication of 
any interaction. It will be observed that the error (a) m.s. is about twice 
as large as that for error (6). 

In table 7 .7 are shown the treatment means, with the principal stand- 
ard errors as obtained from table 7.4. 



TABLE 7.7 

Breaking 

ANGLE 

MEANS 

(degrees) 


Recipe 

175° 

185° 

195° 

205° 

215° 

225° 

Recipe 

means 

I 

29.1 

31.5 

30.8 

33.5 

38.7 

35,1 

33.1 

II 

26.9 

29.4 

31.7 

32.1 

34.5 

35.3 

31.6 

III 

27.9 

28.9 

31.7 

30.9 

34.4 

35.7 

31. C 

Temp, means 

28.0 

30.0 

31.4 

32.2 

35.9 

35.4 

32.1 


Standard error of difference between 
Two recipe means: 

Two temperature means: 

Two temperature means for one recipe: 

Two recipe means for a given temperature: 


/ 2{42.8) 

V 90 

12(205) 

V 45 

[2(20.5) 

V 15 

1215(205) + 427sj 

V 90 


0,98 (28 d.f,) 
0.95 (210 d.f.) 
1.65 (210 d.f.) 

1.80 


The 5% levels of t are 2.05 and 1.97, respectively, for 28 and 210 di. 
Consequently, the 5% level for the last standard error above is 

(5) (20.5) (1.97) + (42,8) (2.05) = ] ;)q 
(5) (20.5) + (42.8) 

This value always lies between the two individual if- values. In practice 
it need rarely be calculated. 


7.18 Missing Data. The formulae for inserting estimates of missing 
values have been developed by Anderson (7.3). Suppose first that the 
observation for a single sub-unit is missing, and that this sub-unit re- 
ceives the treatment combination (aj)j ). Let 

U = total for the emit containing the missing observations. 

{AiBj) = total of all sub-units that receive the treatment combination 

(a.ibj). 

(A i) = total of all observations that receive the ith level of A. 
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Then the estimate to be inserted for the missing value is 

_ rU + PiAjBj) - (Aj) 
y ~ (r - 1)09 - 1) 

Example. Suppose that the observation for recipe II, 195° temper- 
ature, in replication 2 is missing. (Its actual value is 47.) Then 

U = 2S0 - 47 = 233; C AJ&s) = 476 - 47 - 429; 

(Ai) « 2848 - 47 = 2801 

15(233) + 6(429) - 2801 __ 3268 _ „ 

V “ (14) (5) ' ~ ~70 “ 4 ' 

By accident, the estimated value is the same as the actual value. If 
several observations are missing, repeated use is made of the formula as 
described in section 4,25. 

In the analysis of variance 1 di, is subtracted from the error (6) di. 
for each missing observation. An unbiased estimate of E b is obtained, 
but the treatment sums of squares and that for error (a) are biased up- 
wards. If only a small fraction of the values is missing, it appears that 
these biases can be ignored; methods for obtaining unbiased estimates are 
given by Anderson (7.3). 

Standard errors . When there are missing observations, the formulae 
in table 7.8 are suggested for the differences between 2 means. 

TABLE 7.8 Standard errors for split-plot experiments with 

MISSING DATA 

Treatment comparison s.e. 

Difference between two .4 means: + fEb]/rp 

Difference between two B means: \^2Eb(l + (fi3/a)]/ra 

Difference between two B means at the same 
level of A: V 2Eb[l + W/a)]/r 

Difference between two A means at the same 
level of B: V(2E a /rp) + MFV+M 

If only one value is missing in the experiment, and if a mean containing 
that value is compared with another mean, the factor/ is £(r — 1) (/3 — 1), 
and the formulae are exact, reference (7.3). 

With more than 1 missing observation, the value of / depends on the 
locations of the missing observations. The following approximation, 
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developed by G. S. Watson, is correct for a number of cases but other- 
wise tends to be slightly too high. 

/_ * _ 

2 (r ~ d) OS - Jfe + c - 1) 

When counting the values of 7e, c, and <7, as defined below, be sure to 
ignore all missing observations except those that occur in the 2 means 
that are being compared. 

k = number of missing observations. 

c “ number of replications which contain 1 or more missing observa- 
tions. 

d = number of missing observations in the sub-unit treatment {ajbj) 
that is most affected. 

For example, suppose that in the cake experiment the values for recipe 
I, 175° temperature, are missing in replications 1, 2, and 3, while those 
for recipe I, 185° temperature, are missing in replications 3 and 4, We 
are comparing the means for the 2 temperatures, taken over all recipes. 
Then k is 5, and c is 4. The sub-unit treatment with the most missing 
values is recipe I, 175° temperature, which has 3 missing, so that d is 3. 

The case where a complete unit is missing is discussed by Anderson 
(7.3). 

7.2 Repeated Subdivision 

7.21 Description. In order to include a new factor C at y levels, 
the sub-units may each be divided into sub-sub-units. There are three 
experimental error variances. Errors (a) and (b) have the same functions 
as described previously and are calculated in the same mariner, except 
that an extra divisor y is introduced into all sums of squares in order to 
present the analysis on a sub-sub-unit basis. Error (c), which will usually 
be the smallest of the three, applies to the C, AC, BC , and ABC effects. 
The process of subdivision may be carried as far as is convenient. 

The partition of degrees of freedom has already been given (table 7.3) 
for factors A and For C and its interactions, the partition is shown in 
table 7.9. 

TABLE 7,9 Analysis of variance for C and its interactions 


Effect 

d.f. 

C 

(y — 1) 

AC 

(« ~ 1)(t — b 

BC 

03 - 1)( T - 1) 

ABC 

(a - 1)03 - 1 )(t - 1) 

Error (c) 

ot(3(r — 1)(y - 1) 
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The computations in the analysis of variance are a straightforward 
extension of those for the simple split-plot design. The sum of squares 
for AC, for instance, is obtained by calculating the total s.s. for the A 
by C two-way table, and subtracting the sums of squares due to A and 
C; and similarly for the other factorial effects in table 7.9. Finally, the 
sum of squares for error (c) is found by calculating the total s.s. among 
sub-sub-units, and subtracting the contributions from all other items in 
the analysis. Worked examples are given in references (7.4) and (7.5). 

7.22 Standard Errors. The formulae in table 7.4 for standard errors 
applicable to -4 and B effects remain valid apart from division by an ad- 
ditional factor Vy- For the principal comparisons which involve C ef- 
fects, standard errors are shown in table 7.10. All treatment means and 
error mean squares are assumed to be on a sub-sub-unit basis. 


TABLE 7.10 Standabd ekbobs fob the split-plot desion with two 

SUBDIVISIONS 


Treatment comparison 

(fed - Ml 

[fed - fe 0 )l 
fel) - fel)l 
[(Mi) - fed! 
[fed - fed] 


4 - 

4 


' 2 Eo 
ratf 

2Ee 

rtf 


4 


21(7 - DE, + E a ] 


rpy 


4 


'2[(7 - V)E e + Bi\ 
ray 


7.3 Some Variants of the Split-plot Design 

7.31 Systematic Arrangement of the Treatments Applied to the Units. 
The device of subdividing each unit into a number of sub-units is very 
flexible, and can be used with any type of design in which the units are 
arranged. Some of the more useful variations that are possible are de- 
scribed in the remainder of this chapter. The first arises when the nature 
of the experiment makes it necessary or desirable to have the A treat- 
ments arranged in a systematic design. This arrangement has been 
used in experiments where the A treatments were varieties of wheat 
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known to mature at different dates, and the B treatments were prepara- 
tions applied to the seed before sowing. Since each variety was to be 
harvested when ripe, the field operations were made easier by planting 
the varieties in the order in which they would be ready for harvest. The 
field plan is illustrated in table 7.11, where a t is the earliest variety and 
ag the latest. 

TABLE 7.11 Example of systematic arrangement of treatments 

APPLIED TO UNITS 
CL\ <22 «4 fls 



CLl a% C4 <25 



Q >1 Cl 2 f-4 (If) 



The sub-unit treatments are randomized within each unit; the unit 
treatments, however, appear in separate strips. 

This arrangement provides no valid estimate of error for the A main 
effects, or for comparisons such as (ai 6 0 — «o&o) that involve A effects. 
Thus the “whole-plot” analysis of variance is irrelevant. In the sub-plot 
analysis of variance, error ( b ) is still valid for testing the B effects and 
the A B interactions. One point should be watched. A comparison such 
as (ctx&i — aibo) is derived entirely from the first strip in the plan. Con- 
sequently, if the sub-plot error variance differs from strip to strip, the 
use of the error ( b ) m.s. for testing this quantity is not justified; instead, 
it is necessary to compute a separate error using only data from the first 
strip. 

To summarize, systematic placement of the A treatments is advisable 
only where it is essential in order to conduct the experiment and where 
no test of the A main effects is required. 

7.32 Sub-unit Treatments in Strips. In a further variant the sub- 
unit treatments, instead of being randomized independently within each 
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unit, are arranged in strips across each replication. For a 5 X 3 design 
the appropriate rearrangement (after randomization) might be as shown 
in table 7.12. 

This layout may be convenient for field experiments where it is neces- 
sary to test, both factors on relatively large areas and to leave free access 


TABLE 7.12 Design with sub-tinit treatments in strips 
Hop. i 


as a i a* ciq fu 



ill 0,1 m a% 0 a 




Analysis of variance 
d.f. 


Replications 2 

A 4 

Error (a) S 

B 2 

Error (b) 4 

AB 8 

Error (c) 10 

Total 44 


at both ends, Similar conditions may apply in other types of experi- 
mentation. 

As with the ordinary split-plot design, the average precision over all 
treatment comparisons is the same as that of randomized blocks arranged 
on the sub-units. The present design sacrifices precision on the main ef- 
fects of A and B in order to provide higher precision on the interactions, 
which will generally be more accurately determined than in either ran- 
domized blocks or the simple split-plot design. Since in addition the 
numbers of degrees of freedom for estimating the errors of the A and B 
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effects are likely to be small, the design is not recommended unless prac- 
tical considerations necessitate its use or unless the interactions are the 
principal object of study. Both the A and B treatments should be ran- 
domised independently within each replication. 


TABLE 7.13 Latin square design with sub-unit treatments in strips 

«2 «i do &4 uz 



Analysis of variance 


dX 


Rows 

4 

Columns 

4 

A 

4 

Error (a) 

12 

B 

2 

Error (b) 

8 

AB 

8 

Error (c) 

32 

Total 

74 
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In the statistical analysis, separate estimates of error are obtained for 
A , B, and AB. Both sets of main effects may be regarded as tested in 
randomized block designs, in which the sum of squares for replications 
(blocks) is the same for both designs. A worked example (on uniformity 
data) is given in reference (7.6). 

Formulae for the estimated standard errors of the differences between 
treatment means per sub-unit are given in table 7.14. The mean squares 
E a , Ei, , and E c are assumed to be on a sub-unit bavsis. 


TABLE 7.14 Standard errors when stjb-unit treatments are in strips 
(a number of levels of A; (3 « number of levels of B; r - number of replicates) 
Treatment comparison s.e. 

[(«i) - («o)] V 2E a /r(S 


m - m 

IfaiW - («oWl 

[(aibi) — (aifto)] 


V: 2 Eh fra 

V2[(qj — 1 )E C 4* E a }/ra 

VWT^TeT+e^ 


One of the factors may be arranged in a latin square. With a 5 X 3 
design in 5 replicates, a layout of this type is shown in table 7.13. 

The statistical analysis is straightforward. For the B treatments, the 
“blocks” of the randomized blocks design are the rows of the latin square. 
The formulae given above for the standard errors are applicable (with r 
equal to a). 


7.33 Sub-unit Treatments in a Latin Square. In certain cases the 
sub-unit treatments may be arranged in a latin square, with the prospect 
of a further increase in accuracy on sub-unit comparisons. There are 
two types of layout, of which the first will be described in this section; 
the second, which is used when there are only a few whole-unit treat- 
ments whose main effects are not required with high accuracy, is dis- 
cussed in section 8.6. 

In the first type the number of replications must be a multiple of the 
number of sub-unit treatments. For each whole-unit treatment, the 
sub-unit treatments are arranged in one or more separate latin squares. 
An example with 4 whole-unit treatments, 3 sub-unit treatments, and 3 
replications is shown in table 7.15. The latin squares are not very clearly 
displayed in the plan. If we examine the three units that receive a 3 , it is 
seen that each b appears once in the top sub-unit, once in the middle, 
and once at the bottom. Thus the rows of the latin square are the posi- 
tions of the sub-units within the unit. The only change from the ordi- 
nary split-plot arrangement is that differences among these rows are 
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eliminated from the sub-unit errors. If those differences are largo, sub- 
unit comparisons are more precisely determined than in the ordinary de- 
sign. Whole-unit comparisons are unaffected. 

The designs are useful where a fairly consistent gradient is expected 
within each unit. Examples have been quoted previously. In experi- 

TABLE 7.15 Split-plot design with sub-treatments in a Latin square 

«3 tf'2 «0 


Analysis of variance 
Units d.f. 

Replica lions 2 

A 3 

Error (a) 6 

Total 11 

Bub-units 

Rows 8 

B 2 

AB 6 

Error (b) 8 

Total 2d 


ments on plant viruses, the sub-units may be the top, middle, and lower 
leaves of a plant, with a regular gradient of susceptibility down the 
plant. The sub-unit treatments may be operations performed in suc- 
cession by the same person, or drugs injected in succession to the same 
animal, where a time-trend is anticipated. 

The number of replicates may be any multiple of the number of sub- 
unit treatments. With 6 replicates of the design in table 7.15, the ex- 
perimental material is divided into 2 groups of 3 replicates each. If 
practicable, the division should be made in such a way that the within- 
unit gradient is constant in a given group. Since the rows are eliminated 
separately for each group, changes in the gradient from group to group 
do not increase the error. 

In the randomization the first step is to randomize the units according 
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to the instructions for the design in which they are arranged. For each 
whole-unit, treatment a latin square of the size required to accommodate 
the sub-treatments is then randomized as described in section 4.33. A 
new randomization of the latin square is taken for each successive whole- 
unit treatment. The first column of each square is assigned to the first 
replication of the experiment, and so on. 

For extra replication the basic plan is repeated with a new randomiza- 
tion. 

Since the whole-unit analysis of variance is unaffected, it is necessary 
to show only the subdivision of the degrees of freedom among sub-units. 
Table 7.1fi applies to the case in which the basic plan is repeated h times. 


TABLE 7.10 Partition of degrees of freedom among sub-units when 

SUB-TREATMENTS FORM BATIN SQUARES 


Hows 

Sub-unit treatments 

Sub-unit X whole-unit treatments 

Error {b) 


d.f. 


kMfi — 1 ) 

03 - 1 ) 

(a ~~ 1)0* - l) 
a(i 3 — 1 ){r — 1 — ft) 


Total Ta{$ — 1) 

Notation: a « number of whole-unit, treatments. 
fi s= number of sub-unit treatments. 
r = number of replications «= kft. 


The number of degrees of freedom for rows should be noted. Since 
each latin square supplies (/3 — 1) degrees of freedom for rows, every 
whole-unit treatment contributes k(J 3 — 1) degrees of freedom, so that 
the total is ka(fi ~~ 1) degrees of freedom. The sum of squares for rows 
is calculated by addition of the sums of squares for each of the ka latin 
squares. The sum of squares among the columns of the latin squares 
constitutes a whole-unit comparison and does not appear in the portion 
of the analysis shown above. 

Before using this design it is advisable to verify that the number of 
degrees of freedom for error ( b ) is adequate. In the example in table 7.15 
there are only 8 d.f., as against 16 with an ordinary split-plot arrange- 
ment, From section 2.31 it appears that the sub-unit error variance 
must be reduced by at least 15% in order to compensate for the decrease 
in degrees of freedom. 


7.34 Split-plot Technique with Confounded Designs. The treat- 
ments applied to the units may constitute a factorial system arranged in 
a design that confounds certain interactions completely or partially. 
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Since experimenters are sometimes uncertain how to compute the analysis 
of variance in such cases, the following notes indicate the method. 

1. The totals for each unit are analyzed by the instructions fox the 
confounded design in which the units are arranged. 

2. In computing the sums of squares for the sub-unit treatments and 
for all interactions between sub-unit and whole-unit treatments, the 
confounding is ignored. Even if a whole-unit factorial effect is com- 
pletely confounded, its interactions with sub-unit treatments are im~ 
confounded and may be tested by means of error (6). 

3. The sum of squares for error (6) is found as usual by subtracting the 
total s.s. on the whole units plus the total s.s. for sub-unit treatments 
and for interactions between sub-unit and whole-unit treatments from 
the total s.s. for sub-units. 

A difficulty arises occasionally in the presentation of tables. Suppose 
that a 3 X 3 X 2 design is applied to the units in blocks of G units, with 
A B and ABC partially confounded, and that a factor D at 2 levels is ap- 
plied to the sub-units. In the A by B two-way table, the entries must be 
adjusted to eliminate block effects. The problem is how to present the 
ABD three-way table. The observed means (a 0 <V4) and (a$hodi) will 
average to the unadjusted mean for (a-o&o)? which contains block effects. 
The simplest process is to calculate the difference between the adjusted 
and the unadjusted means of (oo&o)« This difference is then applied to 
(«oWo) a nd (tfoMi). Notice that this adjustment does not change the 
difference between these two entries and brings their mean to the correct 
value. 

7*35 Confounding of Comparisons among Sub-unit Treatments* 

When the sub-unit treatments are factorial, it may be advantageous to 
confound certain interactions among these factors with the units. The 
technique, first presented by Yates (7.7), is illustrated in table 7.17, 
which shows a single replication of the design. The factor A applied to 
the units has 2 levels, while the sub-unit treatments are the 8 combina- 
tions of factors B ) C, and I) at 2 levels each. The ordinary split-plot ar- 
rangement is shown in (i), while (ii) and (hi) are two alternatives for 
the new method. 

Consider first (i) and (ii). In the ordinary arrangement the unit con- 
tains 8 sub-units, to which the 8 sub-treatments are allotted at random. 
With the new arrangement the sub-units are in groups of 4, chosen so 
that BCD is confounded with group totals. The advantage of the new 
layout is that the size of the whole-unit has been reduced from 8 to 4 sub- 
units. This will in general lead to more accurate estimates both of the 
A effects and of the sub-unit treatments. The only exceptions are the 
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high-order interactions BCD and A BCD. In the ordinary arrangement 
these interactions are derived from comparisons among sub-units in the 
same unit, whereas in the new design they are estimated from com- 
parisons among the units, as the reader may verify. 

Arrangement (iii) goes a step further. The a 0 and cii treatments which 
have opposite sets of sub-unit treatments are placed in the same sub- 


TABLE 7.17 Confounding of BCD in a split-plot experiment 
i. Ordinary arrangement ii. New arrangement 


(in 


at 


an 


a o 


«i 


a\ 



bed 

b 

d 

a) 

bd 

be 

c 

cd 


bd 

d 

r 

ed 

bed 

b 

(1) 

be 


iii. New arrangement in sub-blocks 


Block la Block 16 


«o a i an a i 



block. The result is that A BCD is complet ely confounded between sub- 
blocks, while BCD, like A, is orthogonal with sub-blocks. The aim is to 
sacrifice information on ABCD in the hope of obtaining more precise 
estimates of A and BCD , Consequently, alternative (iii) is usually 
preferable to (ii). 

With 5 replications of arrangement (iii), the partition of degrees of 
freedom is set out in table 7.18 in condensed form. There are 80 sub- 
units and 1.0 sub-blocks, 

TABLE 7.18 Partition of degrees of freedom in a 2 X 2 s design with 
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Units 

cU 

Sub-units 

d.f. 

Between sub-blocks 

9 

B, C, D, BC, BD , CD 

6 

A 

1 

AB , AC, AD, ABC , ABD, ACD 

6 

BCD 

1 

Error (6) 

48 

Error (a) 

8 


— 

Total between units 

19 

Grand total between sub-units 

79 













7.35 


SUB-UNIT TREATMENTS CONFOUNDED 


239 


All sums of squares are calculated in the usual way. Note that ABC D 
does not appear, since it is a component of the sum of squares between 
sub-blocks. 

When the sub-unit treatments form a 2 3 system, a similar technique 
may be used for any number of levels of the factor A. Single replications 
of the layouts are shown in table 7.19 for A at 2 , 3, 4, 5, and G levels. 


TABLE 7.19 Split-plot design foe an a X 2 s factorial system 
Arrangement, of whole-units in a single, replication 
a - 2;(2 X 2 s ) « = 3;(3 X 2 s ) 

Id b> hi b. 


«0 

ad 


ad 


, ! 


«1 

«2 


ao' 


ad 


a = 4;(4 X 2 s ) 


la 


If. 



ao 


0*2 

as 


ao' 


aj 

as 




Ia 

a - 5;(5 X 2 s ) 

h. 



a Q 

ai 

a 2 

as 

a>4 


ad 

ia 

ad 

ad 

ad 


a = G;(6 X 2 3 ) 

la I» 


ad 

«i 1 

ad 

as 

ad 

■ 

1 m 



ad 


ad 

a 4 

1 

ttd 


Each unit contains 4 sub-units. A prime (') implies that the sub-unit treatments 
are (1), be, brt, and cd, and an unaccented a that they are b, c, </, and bed. 

The arrangements differ slightly according as the number of levels of 
A is even or odd. With an even level, one component of A BCD is con- 
founded between sub-blocks. For example, with A at 4 levels, this 
component is (03 + % — 02 — ao) BCD. When A has an odd number 
of levels, we are forced to confound BCD at least partially with suit- 
blocks, and for simplicity it has been completely confounded. 

The randomization for a single replicate is as follows, (i) Allot the 
sub-blocks a and b at random to the 2 halves into which the replicate has 
been divided, (ii) Allot the levels of A at random to units within each 
sub-block, (iii) Allot the appropriate set of 4 sub-treatments at random 
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within each unit. A separate randomization is used in each replication. 

The partition of degrees of freedom in the analysis of variance is shown 
in table 7.20 for r replications of the basic design. 

TABLE 7.20 Partition of degrees of freedom for an a' X 2 s split-plot 


DESIGN 



Units 

Sub-units 


a even d.f. a odd d.f. 


d.f. 

Sub-blocks (2r - 1) (2r - 1) 

A (at - 1) (a- 1) 

BCD 1 

ABCD (a - 2) (a - 1) 

Error (a) 2(a — l)(r — 1) 2(« — l)(r — 1) 

Total (2 oar — 1) (2 ar — 1) 

Sub-treatments 

A X sub-treatments 
Error (6) 

Total 

6 

6(ck — 1) 
6ct(r — 1) 
Oar 


When a is odd, the sum of squares for BCD is omitted from the treat- 
ments; otherwise all sums of squares are obtained in the usual way. 
With a even, 1 d.f. is missing from A BCD, since it is confounded with 
sub-blocks. Since these designs are unlikely to be used unless ABCD is 
small, it will usually be satisfactory to combine the ABCD sum of 
squares with error (a). The pooled sum of squares can be found by sub- 
traction without calculating that for ABCD . 

A considerable number of arrangements of this type are possible, 
though some care is required in their construction in order to avoid 
confounding important comparisons. Finney (7.8) gives an account of 
the principles of construction and designs with worked examples, for a 
4. X 2 2 and a 6 X 2 2 factorial, where the units are in a 4 X 4- and 6 X 6 
latin square, respectively. Yates (7.7) gives a similar design for 6 X 2 ;] 
factorial. 
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CHAPTER 8 


FACTORIAL EXPERIMENTS CONFOUNDED IN QUASI-LATIN 

SQUARES 


8.1 Introduction 

Yates (8.1) has constructed a number of confounded arrangements for 
factorial designs in latin squares. In the first group of designs (plans 
8. 1-8.7), some of the interactions among the factors are confounded with 
the rows and columns of the squares. These designs resemble ordinary 
latin squares in that the differences among the rows and columns of the 
squares are eliminated from the experimental errors of the unconfoundod 
treatment effects; however, the designs do not possess the typical latin 
square property that each treatment appears once in every row and every 
column. Yates (8.1) has proposed the name quasi-latin squares. 

Before one of these plans is used, it is advisable to note the interactions 
that are confounded and the extent to which they are confounded, par- 
ticularly since more treatment effects must be confounded with latin 
squares than with randomized blocks. Of the eight plans shown, only 
one involves the confounding of two-factor interactions. 

In the statistical analysis, the sums of squares for rows, columns, and 
all unconfounded treatment effects are calculated in the usual way. 
Partially confounded effects, if considered negligible, may be combined 
with the error, which is found by subtraction; or they may be tested for 
significance by calculating their sums of squares from those portions of 
the experiment in which the effects are unconfounded. 

In a further group of designs (plans 8.8-8.14), certain treatments are 
applied to complete rows or columns of a latin square. These designs 
have been called half-plaid squares or plaid squares, from the resemblance 
of the plan to a Scotch plaid. 

8.2 Randomization of Quasi-latin Squares 

The rows and the columns of each latin square are rearranged at ran- 
dom. In certain plans (e.g., 8.2) each square contains several replica- 
tions, The randomization must not be restricted in such cases with the 
object of keeping the replications separate. 
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8.3 Notes on the Plans and Statistical Analysis 

# factorial in a 4 X 4 square. There are a number of alternative de- 
signs, of which two are given in plan 8.1a and 8.1&. In the former, which 
requires 4 replications, X A relative information is retained on AB, AC, 
BC , and ABC, main effects being unconfounded. On account of the high 
degree of confounding of the two-factor interactions, this plan is not 
recommended unless the factors are presumed to act independently or 
unless the 4X4 square is considered much more accurate than alter- 
native designs. 

A numerical example illustrates the general method of analysis for all 
designs presented here. The data in table 8.1 represent twice the yields 


TABLE 8.1 Yields and statistical analysis of a 2 3 factorial in two 

4X4 QUASI-LATIN SQUARES 


Square I 

ABC AC 

c (1) abc ab Totals 

57.9 02.3 00.2 58.0 239.0 

a ac b he 

60,2 60.5 00,0 53.4 240.1 


AB 


abc be (1) a 

04.1 61.3 63.2 59,2 247.8 

h ab ac c 

58.4 61.8 67.0 61.9 249.1 


BC 


Square II 
AB BC 

c a abc b Totals 

62.4 64.6 68.5 63.1 258,6 

(1) ac he ab 

61.5 62.7 58.6 69.3 252.1 


ABC 


abc b (1) ac 

66.0 56.5 63.5 67.3 253.3 

ah be a c 

65.9 52.6 58.1 58.1 234.7 


AC 


Totals 246.6 245.9 250.4 233.1 976.0 


Squares 

d.f. 

1 

Bows 

6 

Columns 

6 

A , B, C 

3 

AB 

V 

AC 

V 

BC 

V 

ABC 

T 

Error 

11 

Total 

31 


255,8 236.4 248.7 257.8 998.7 


s.s. 

na.s, 

16.10 


100.90 

16.82 

112.71 

18.78 

146.43 


18.92 

18.92 

0.06 

0.06 

0.33 

0.33 

8.27 

8.27 

101.86 

9.26 

505.58 
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in bushels per acre from a soil-treatment test on soybeans, at Muscatine, 
Iowa. The plots were split for a varietal comparison which has been 
omitted. For convenience, rows and columns were rearranged so that 
the field plan corresponds to plan 8.1a. The factors were limestone (A), 
phosphorus (B), and potash (C ) ; in each case the lower level implies no 
application of the treatment. 

To obtain the sums of squares for the treatment effects, first calculate 
the factorial effect totals in the usual way. These are shown in the first 
line of table 8.2. For the partially confounded effects, these totals must 


TABLE 8.2 Treatment effects for a 2 3 factorial, in 4 X 4 

QUASI-LATIN SQUARES 



[A] 

[B] 

[C 1 

\AB] 

\AC) 

[BP] 

[ABO] 

Totals 

+65.3 

-18.1 

-9.7 

+35.7 

+34.9 

- 8.1 

- 4.3 

From rows and columns 




+ 18.3 

+35.9 

- 10.4 

+ 7.2 

Adjusted totals 




+17.4 

- 3.0 

+ 2.3 

-11.5 


be adjusted. Now AB is completely confounded with the first 2 rows of 
square I and the first 2 columns of square II. Calculate AB as obtained 
from these rows and columns, i.e., 

239.0 - 240.1 + 255.8 - 236.4 - +18.3 

This figure is placed in the second row of table 8.2, with corresponding 
figures for AC, BC , and ABC. The adjusted totals are found by sub- 
tracting the second from the first row. Of course, these represent just 
the totals for each effect as found from that part of the experiment in 
which the effect is unconfounded. The divisors for the squares are 32 
for main effects and 16 for the other effects. 

The two-way tables of means require adjustment. For the AB table 
the adjustment is 

(+17.4) (+35.7) 

— - L- - —0.03 

16 32 

This value is added (algebraically) to the (1) and (ah) means and sub- 
tracted from the (a) and (b) means. 

Plan 8.15 completely confounds ABC with squares; all other effects 
are unconfounded. The arrangement has the peculiarity that half the 
treatments appear only in square I, while the other half appear only in 
square II. Although this layout avoids the confounding of any two- 
factor interactions, Yates has pointed out that any interaction of the A 
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effect with squares will appear in the analysis as a BC interaction. The 
results are therefore likely to be misleading if the experimental material 
in square I differs in responsiveness from that in square II. Where this 
danger seems likely, Yates suggests that the columns (or rows) of the 
two squares may be interlaced, so that each square comprizes a ran- 
domly chosen half of the experimental material A numerical example 
of the statistical analysis is given by Yates (8.1, p. 83), using uniformity 
data. 

jg 4 factorial in an 8X8 square. Four replications are required for plan 
8.2. A BCD is completely confounded, and threefold replication is ob- 
tained on all three-factor interactions. The degrees of freedom sub- 
divide as follows. 


Rows 

d.f. 

7 

Columns 

7 

Treatments 

14 

Error 

35 


In the computation for ABC , ABD, ACD , or BCD , the appropriate pair 
of rows is deleted. 

$ 5 factorial in an 8 X 8 square. The confounding in plan 8.3 (for 2 
replications) is not completely balanced. Eight of the 10 three-factor 
interactions and four of the 5 four-factor interactions receive only single 
replication. The remaining high-order interactions, ABE , ODE, A BCD, 
and ABODE , are unconfounded. By a proper assignment of the letters 
a, b, • * * to the factors, any single three-factor interaction may be left 
unconfounded, though not any desired pair of interactions. 

The partition of degrees of freedom is: 


Rows 

d.f. 

7 

Columns 

7 

Treatments 

31 

Error 

18 


In the computation of the sum of squares for ABC, ADE , or BCDE, the 
4 rows in the experiment which correspond to the first 4 rows of plan 8,3 
are omitted. The 4 rows or columns to be omitted for any other partially 
confounded comparison are found similarly from plan 8.3. If the high- 
order interaction effects are likely to be negligible, all partially con- 
founded effects and ABCD and ABODE may be combined with the 
error. This procedure gives 17 d.f. for treatments and 32 for error. 
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factorial in an 8X8 square . In each square of plan 8.4, 8 three- 
factor and 6 four-factor interactions are completely confounded. If the 
experiment has a single replication and if only the main effects and two- 
factor interactions are isolated for tests of significance, the degrees of 
freedom are allotted as follows. 

elf. 


Rows 7 

Columns 7 

Main effects <> 

Two-factor interactions 1 5 


Error (estimated from high-order interactions) 28 


Any un confounded three-factor interaction may also be tested. 

When there are several replications in the experiment, the squares 
should be chosen so that the most important three-factor interactions 
are free from confounding. The balanced design, which requires 5 repli- 
cations, gives 3/5 relative information on all three- and four-factor inter- 
actions. 

3 s factorial in a 9 X 9 square. Three or six replicates may be used 
(plan 8.5). Four of the 8 di. for ABC are completely confounded in each 
square. The 2 squares together give balanced confounding of ABC , with 
1/2 relative information. 

When square I alone is used, the components of the analysis of vari- 
ance are: 



elf. 

Rows 

8 

Columns 

8 

A, B } C 

6 

AB, AC , BO 

12 

ABC 

4 

Error 

42 


The’ 4 di. for ABC are evaluated as follows. Compute the total yields 
of the 3 groups of 9 treatments which are described as set II (a), (c), and 
(h) in square 2. The sum of squares of deviations of these 3 totals, with 
divisor 27, constitutes 2 of the 4 di. The remaining 2 are found similarly 
from set IV (a), (6), and (c) in square 2. 

With both squares, all 8 di. for ABC may be computed, sets I and III 
being taken from the experimental data for square II, and sets II and IV 
from the data for square I. 

factorial in a 9 X 9 square . Each square in plan 8.6 provides 1/2 
relative information on ABC, ABB , ACD , and BCD . For an experiment 
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with a single replication, the simplest subdivision of the degrees of free- 

dom is: 


d.f. 



Rows 

8 



Columns 

8 



Main effects 

8 



Two-factor interactions 

24 



Error (estimated from high-order interactions) 

32 



When 2 replications are available, both squares should be used. All 
8 d.f. for any three-factor interaction may be tested, though each is based 
on only a single replication. The calculation of the sums of squares for 
the three-factor interactions follows the method outlined for the 3 4 fac- 
torial in blocks of 9 units (section 6.32). Plan 8.6 indicates the replica- 
tion from which imy component must be taken. Thus, if ABC is being 
calculated, we use the data from square II for ABC II and ABC IV and 
the data from square I for ABC I and ABC III. 

4 X factorial in an 8X$ square . Plan 8.7 requires 4 replications. 
The symbols A", A tff refer to components of the A effects. 

A 1 = + &2 ~~ oq — ao] A n — ~ — a\ + 

A fn » 4 — ao 

No two-factor interaction is confounded. 

In view of the small relative information on ABC (only 1/3 on the 
average), the sum of squares for this term will usually be combined with 
the error. In this event the analysis of variance reads as follows. 

d.f. 


Rows 7 

Columns 7 

Main effects and two-factor interactions 12 

Error (by subtraction) 37 


8.4 Other Quasi-latin Squares 

Yates (8.1) gives experimental plans which can be used for the follow- 
ing factorial combinations; 3 2 X 2 (6 X 6 square), 4 X 2 3 (8X8 
square), and 9 2 (9 X 9 square). Analogous to the last design are designs 
for 7 2 (7 X 7 square) and 5 2 (5 X 5 square). In a general discussion of 
methods for constructing such plans, Rao (8.3) presents designs for a 2 4 
factorial in 4 X 4 squares. 

For certain factorial combinations of treatments, the lattice squares 
(chapter 12) and the incomplete latin squares (chapter 13) confound all 
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treatment effects to tlie same extent. These arrangements may be use- 
ful in cases where the experimenter does not wish to sacrifice information 
on the interactions. 


8.5 Estimation of the Efficiency of Quasi-latin Squares 

In many cases the precision of a quasi-latin square can he compared 
with that of some other design which might have been used. The data 
from the example in tabic 8.1 will illustrate two types of comparison. 

In this experiment a factorial system applied to plots of soybeans 
was arranged in two 4 X 4 squares, with 1 /2 relative information on 
AB, AC, BC , and ABC. As an alternative, the experiment might have 
been laid out in randomized incomplete blocks of 4 plots each, with 
complete confounding of ABC (plan 6.1). Since the columns were more 
compact in shape than the rows, we will suppose that the columns would 
have been chosen as blocks. 

We first estimate the error m.s. that would have been obtained with 
incomplete blocks. It is helpful to construct analyses of variance for the 
2 designs (table 8.3) for the case where there are no treatments. The 

TABLE 8.3 Analyses of variance (applicable to uniformity data) 


Quasi4.Tt.hi square Incomplete blocks 



d.f. 

m.s. 


di. 

m.s. 

Columns 

0 

18.78 

Blocks (columns) 

6 

18.78 

Hows 

6 

10.82 

intra-blocks 

24 

(11.15) 

Intra-row-and-oolumn 

18 

9.20 





mean squares to be inserted in the table are explained in the next para- 
graph. 

With incomplete blocks, the degrees of freedom to be divided between 
treatments and error are the 24 d.f. within columns (3 from each of the 
8 columns). Since the randomization ascribes to treatments a random 
selection from these 24 d.f., the mean square for the 24 d.f. provides an 
unbiased estimate of the error appropriate to the incomplete block de- 
sign. The 24 di. subdivide into 6 representing variations among rows, 
and 18 representing intra-row-and-column variation. If the interactions 
partially confounded with the rows are regarded as negligible, the rows 
m.s. (16.82) from the quasi-latin square analysis provides an estimate of 
the 6 d.f. The quasi-latin square analysis does not supply information 
on. the whole of the remaining 18 di., since in the experiment 7 of these 
were ascribed to treatments. By virtue of the randomization for the 
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quasi-latin. square, however, the error m.s. (9.20), with Ji d.f., may be 
taken as an estimate of the mean square for the whole IS d.f. 

Accordingly, an unbiased estimate of the error m.s. for the incomplete 
block design is 

IS X 9.20 + 0 X 16.82 _ _ 

r = 13.35 

24 

For the estimation of main effects, the efficiency of the quusi-Iatin 
square relative to incomplete blocks is estimated as 11.15/9.26, or 120%. 
Owing to the confounding in the quasi-latin square, the relative efficiency 
on the interactions is only 60%. 

The quasi-latin square may also be compared with ordinary ran- 
domized blocks of 8 plots, where each replication is a pair of columns. 
In table 8.1 the column totals are as follows. 

hep. I Rep. II Rep- II L Rep. IV 

240. U 245.9 250.4 283.1 255.8 230.4 248.7 257.8 

With uniformity data, the randomized blocks error contains 28 d.f, 
(7 from each replicate). To obtain this error, the incomplete blocks 
error (24 d.f.) must be pooled with the 4 d.f, which measure differences 
between the 2 columns in each pair. Since the differences between the 
2 column totals in each replication are 0.7, 17.3, 19.4, and 9.1, respec- 
tively, the sum of squares for the 4 di. is 

(0.7) 2 + (17.3) 2 + (19.4)- + (9.1 f 

= 94.87 

8 

This gives [24(11.15) + 94.87]/28 — 12.95 as an estimate of the random- 
ized blocks error. The relative efficiency of the quasi-latin square is 
140% on main effects and 70% on interactions. 

The comparisons above require the assumption that treatment effects 
which are confounded with rows or columns are non-existent. By more 
complicated procedures, estimates can be made which avoid those as- 
sumptions. The relative efficiencies quoted above are slightly too high, 
because we neglected the effect of the additional error di. that would 
be available in the alternative designs (section 2.31). 

8.6 Treatments Applied to Complete Rows of a Latin Square 

. A simple example (taken from plan 8.8) is shown below. The latin 
square treatments form a 2 X 2 factorial for factors B and C . An extra 



8.6 TREATMENTS APPLIED TO COMPLETE ROWS 240 

factor A has been superimposed on the rows. The first 2 rows receive 
the lower level of A , and the last 2 rows the higher level. 


ABC 


h ! 

b 

c 

(i) 

be 


V 

h 

he 

a) 

a 

a) 

be 

b 


a 

be 

(1) 

e 

b 


This plan might be tried in a case where the extra factor was not suit- 
able for application to individual units. No precise estimate of the ef- 
fect of A would be expected, but some useful information on the inter- 
actions between A and the other factors might be hoped for. However, 
this device has one property that necessitates caution in its use. When- 
ever an extra factor is superimposed on the rows, certain interactions 
between that factor and the latin square treatments are automatically 
confounded with columns, as noted by Yates (8.2). In the plan above, 
the A BC interaction 

(ahe) + (a) + (f>) + (c) - (ah) - ( ac ) — (be) ~~ (1) 

is the difference between the first 2 columns and the last 2 columns. On 
the other hand, AB and AC are free of column effects. In the plans pre- 
sented here, which are due to Yates (8.1), such confounding is confined 
as far as possible to high-order interactions. The reader who constructs 
his own designs should verity, before using them, what interactions have 
been confounded in this way. 

As indicated above, these designs are useful in experiments where (i) 
it is not easy to apply factor A to individual units, and (ii) accurate 
estimates of the interactions of A, but not of its main effects, are wanted. 
For instance, in an experiment on an irrigated crop, we might wish to 
find out how the responses to plant nutrients B and C are affected by the 
level of the water supply. Factor A could then represent a restricted 
and an abundant amount of irrigation water, applied to whole rows of the 
square. 

These half-plaid squares are related both to quasi-latin squares and to 
split-plot designs. The difference from quasi-latin squares is that in the 
latter the confounding is confined to interactions, whereas main effects 
also are confounded in half-plaid squares. The connection with split- 
plot designs is seen when the rows are regarded as the experimental 
units, and the units (or plots) as sub-units. From this point of view the 
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plan above may be considered as a split-plot design with the sub-units 
arranged in latin squares. 

A moderately large number of useful half-plaid squares can be con- 
structed; only five of the simplest are given here. In the notation em- 
ployed, the factorial system for the latin square treatments is placed in 
parentheses ( ), while the number of treatments imposed on the rows 
is shown before the parentheses. Thus plan 8.8 shows a 2 X (2 2 ) fac- 
torial. 

As with an ordinary latin square, rows and columns must both be per- 
muted at random. The whole-row treatments cannot be arranged in 
separate blocks. 

The statistical analysis may be followed from the partition of the de- 
grees of freedom, which is given in condensed form with each plan. As 
in the split-plot design, there are two errors. The error for the factor A 
applied to the rows is derived from the sum of squares among rows minus 
the sum of squares for A . 

Most of the other treatment effects are unconfounded. Their sums of 
squares are calculated in the usual way and tested against the latin 
square error. In plans 8.9 and 8,10 certain interactions are partially con- 
founded with columns; the calculations for these effects are described 
in the notes below. In the remainder of the plans, some interactions are 
completely confounded with rows or columns and consequently do not 
appear in the treatments s.s. The error s.s. is found by subtraction. 

8,61 Notes on the Plans and Statistical Analysis. S X (3 2 ) factorial 
in a 4 X 4 square (plan 8.8). At least two squares (4 replications) should 
be used. 

$ X {8 X S) factorial in a 6 X 6 square (plan 8.9). Each square con- 
tains 3 replicates. BC and ABC are partially confounded with columns, 
the relative information being 8/9 and 5/9, respectively. The sums of 
squares for these effects are calculated by the procedure for the 3 X 2 2 
factorial in incomplete blocks of 6 units, where the columns constitute 
the blocks. A numerical example is given by Yates (8.1, p. 58). Col- 
umns la • ■ ♦ III6 are identical, respectively, with Yates’ blocks la * • * 
III5. Notice that the factor applied to the rows is <7, 

8 X (8 X 8) factorial in a 6 X 6 square (plan 8.10). Each square re- 
quires 2 replicates, while the 2 squares comprize a balanced set. Col- 
umn effects are partially confounded with AB and ABC, the average 
relative information being 7/8 and 5/8, respectively. If the columns are 
regarded as incomplete blocks, the numerical example in section 6.19 
may be followed for the computation of the sums of squares for AB and 
ABC . 
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The statistical analyses for the 2 X (2 3 ) factorial (plan 8.11) and the 
2 X (2 4 ) factorial (plan 8.12) in 8 X 8 squares are straight forward. In 
the latter, the BODE and ABODE interactions have been combined 
with the error for A . 

Other designs . Yates (8.1, p. 79) presents the plans for a 4 X (2 8 ) fac- 
torial in an 8X8 square and (8.1, p. 80) for a 3 X (3 s ) factorial in a 
9 X 9 square. Plans can also be constructed for a 2 X (2 r> ) system in a 
8X8 square and for a 3 X (3 2 ) system in a 9 X 9 square. 

8.7 Treatments Applied to Complete Rows and Columns of a Latin 

Square 

In these designs one set. of treatments A is applied to complete rows 
of the latin square and another set 74 to complete columns. Although 
relatively low precision is obtained on the row and column treatments, 
the latin square treatments and their more important interactions with 
the A and B treatments are subject only to the latin square error. The 
number of useful designs appears to be limited. 

Two examples are shown in plans 8.13 and 8.14. The latin square 
treatments form a 2 3 and 2 4 system, respectively; the additional treat- 
ments are both at 2 levels. No two-factor interactions are confounded. 
The rows and columns of each square should be completely randomized. 

The subdivision of degrees of freedom is indicated on each plan. In- 
teractions which are confounded with rows and columns must be omitted 
from the treatments s.s.: all other treatment effects are calculated in the 
usual way. In plan 8.14, where the square contains a single replication, 
the only available estimate of error comes from the high-order inter- 
actions, unless two or more squares are used. 

A 9X9 square for a 3 2 system of treatments, with row and column 
treatments also at 3 levels each, is given in reference (8.1), p. 81. 
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Plan 8.1a 2 1 * 3 factorial in two 4X1 quasi-latin squares 
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Plan SJh 2 s factorial in two 4X4 latin squares 

ABC completely confounded with squares 
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Plan 8.2 


2 4 factorial in an 8 X 8 quasi-latin square 
A BCD 


c 

abed 

h 

ad 

a 

bd 

abc 

cd 

abd 

(i) 

bed 

be 

acd 

ac 

d 

ab 

d 

be 

a 

abed 

h 

cd 

abd 

ae 

bed 

ad 

acd 

bd 

ahe 

ab 

c 

0) 

a 

bd 

e 

ab 

d 

abed 

acd 

he 

abc 

ac 

abd 

cd 

bed 

(1) 

b 

ad 


ABC 


ABD 

ACD 


h ah d ae c ad bed abed j> ( t r) 

acd cd abc (1) abd be a bd 



Plan 8.3 


2 r> factorial in an 8X8 quusi-latm square 


A CE , BCD, A BDE A CD, BDE , .4 BCE 


(1) 

ahe 

be 

ace 

ahd 

aed 

bode 

de 

her 

ae 

e 

ah 

bed 

d 

abde 

a ede 

ede 

abed 

bde 

ad 

alee 

ae 

h 

c 

hd 

adc 

cd 

abode 

be 

ee 

abc 

a 

ahe. 

ee 

arde 

bed 

(1) 

bde 

ml 

ahe. 

aed 

bade. 

abce 

r 

ade 

ah 

e 

hd 

abde 

d 

a 

be 

ede 

he 

ace 

abed 

ae 

b 

ahd 

do 

ae 

abode 

ed 

bee 


ABC , ADE 3 BCDE 


ABD , BCE , 4C/;2? 


77arc 5.4 2° factorial in 8 X 8 quasi-laiin squares 
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Square V 
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Plan 8.5 3 3 factorial in 9 X 9 quasi-latin squares 
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Plan Sjj (Continued) 3 :i factorial in 0 X 9 quasidatin squares 

Square 11 
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CHAPTER 9 

BALANCED AND PARTIALLY BALANCED INCOMPLETE 
BLOCK DESIGNS 

9.1 Balanced Designs 

As their name implies, these designs, introduced by Yates (9.1), are 
arranged in blocks or groups that are smaller than a complete replica- 
tion, in order to eliminate heterogeneity to a greater extent than is pos- 
sible with randomized Hocks and latin squares. In the designs de- 
scribed in chapters 6 to 8, this reduction in the size of block was achieved 
by sacrificing all or part of the information on certain treatment com- 
parisons. The present designs, on the other hand, were developed for 
experiments in plant breeding and selection, where it is desired to make 
all comparisons among pairs of treatments with equal precision. Con- 
sequently, a different method for reducing the size of block is employed. 

The designs may be arranged either in randomized incomplete blocks 
or in quasi-latin squares. They may be balanced or partially balanced . 
The balanced designs will be illustrated first by simple examples of the 
experimental plans. 

Consider the plan in table 9.1 which compares 9 treatments in in- 
complete blocks of 3 experimental units with 4 replications. 

Every pair of treatments will be found to occur once, and only once, 
in the same block. For instance, treatment 1 occupies the same Hock 

TABLE 9.1 Balanced design foe 9 treatments in blocks of 3 units 


Block Rep. I 


Rop. 11 


Rep. 

III 


Rep. IV 

(1) l 2 3 

(4) 

1 4 7 

(7) 

1 S 

6 

(10) 

1 5 9 

(2) 4 5 6 

(S) 

2 5 8 

■ (8) 

4 2 

9 

(11) 

7 2 6 

(3) 7 8 9 

(6) 

3 6 9 

(9) 

7 5 

~3 

(12) 

4 8 3 


with treatments 2 and 3 in the first replication, with treatments 4 and 7 
in the second replication, with treatments 6 and 8 in the third replica- 
tion, and with treatments 5 and 9 in the fourth replication. When the 
results are analyzed by the method of least squares, this property, to 

259 



260 


INCOMPLETE BLOCK DESIGNS 


9.1 


which the adjective balanced is applied, ensures that all pairs of treat- 
ments are compared with approximately the same precision, even though 
the differences among blocks may be large. This design belongs to the 
group known as balanced lattices, so-called because the plans are con- 
veniently written down by drawing a square lattice, with the treatment 
numbers at the intersections of the lines. In the balanced lattices, the 
number of treatments must be an exact square while the number of units 
per block is the corresponding square root. 

Balanced designs can be constructed for other numbers of treatments 
and of units per block. The plan in table 9.2 shows 7 treatments ar- 
ranged in blocks of 3 units. 


TABLE 9.2 Balanced design for 7 treatments in blocks of 3 units 
Block 


(1) 

1 2 4 

(3) 3 4 6 

(5) 1 5 6 

(7) 1 3 7 

(2) 

2 3 5 

(4) 4 5 7 

(6) 2 6 7 



Again every pair of treatments occurs once within some block. In 
this case, however, the blocks cannot be grouped in separate replications, 
since 7 is not divisible by 3. Designs of this type are known as balanced 
incomplete blocks . 

For certain numbers of treatments and units per block, both the types 
above can be laid out in a kind of latin square formation so as to allow 
the elimination of variation arising from two types of grouping. The 
appropriate rearrangement for the first example is shown in table 9.3. 

TABLE 9.3 Balanced design for 9 treatments in 4 lattice squares 

Rep. I Rep. II Rep. Ill Rep. IV 

Columns 

Rows (1) (2) (3) (4) (5) (6) (7) (8) (9) (10) (11) (12) 


(1) 

1 

2 

3 

(4) 

T 

~T 

7 

(7) 

1 

6 

8 

(10) 

1 

9 

5 

(2) 

4 

5 

6 

(5) 

2 

5 

8 

(8) 

9 

2 

4 

(11) 

6 

2 

7 

(3) 

7 

8 

9 

(6) 

sT 

6 

' 9 

(9) 

5 

7 

3 

(12) 

“8 

4 

3 


It may be verified by inspection that every pair of treatments now 
occurs once in the same row and also once in the same column. All 
comparisons between pairs of treatments are of nearly equal precision. 
This design is known as a lattice square . 

The second example is rewritten somewhat differently. 

Every treatment now appears in each of the 3 rows, and every pair of 
treatments appears together once in the same column. Since the plan 
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TABLE 9.4 Balanced design for 7 treatments in an incomplete 

LATIN SQUARE 



Columns (Blocks) 



(1) 

(2) 

(3) 

14) 

(5) 

(6) 

(7) 

Rows 







(1) 1 

2 

3 

4 

5 

ft 

7 

(2) 2 

3 

4 

5 

0 

7 

1 

(3) 4 

5 

6 

7 

1 

, 2 

3 


above could represent the first 3 rows of a 7 X 7 latin square, this type 
of design has been called an incomplete latin square , or alternatively a 
Youden square , after W. J. Youden, who developed these designs for 
greenhouse experiments. 


9.2 Partially Balanced Designs 

Although a balanced design can be constructed with any number of 
treatments and any number of units per block, the minimum number of 
replications is fixed by these 2 variables. In most cases this number is 
too large for the usual conditions of experimentation. In order to allow 
more freedom of choice in the number of replicates, designs which lack 
the complete symmetry of the balanced designs must be used. 

The most useful of such designs are the lattices. These are constructed 
in the same way as balanced lattices except that there are fewer replica- 
tions. The design with 2 replications (e.g., the first 2 replications in 
table 9.1) is called a simple lattice , and that with 8 replications a triple 
lattice . Similarly, with a lattice square, as in table 9.3, we may use less 
than the full number of replicates necessary for balance. For all these 
designs the number of treatments must be a perfect square. 

A further set, called cubic lattices , is useful when the number of treat- 
ments is very large. The number of treatments is the cube of the num- 
ber of units per block, so that a drastic reduction in block, size is obtained. 
The number of replicates is 3 or some multiple of 3. 

Partially balanced designs are less suitable than balanced designs in a 
number of ways. The statistical analysis is more complicated. It is 
mainly for this reason that we have not included any partially balanced 
designs corresponding to those in tables 9.2 and 9,4. Further, when the 
variation among blocks (or rows and columns) is large, some pairs of 
treatments are more precisely compared than others, so that several 
standard errors are required for tests of significance. Also, the overall 
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precision of the experiment is decreased by the lack of symmetry. This 
means that under comparable conditions a partially balanced design 
gives a slightly higher standard error per replication than a balanced de- 
sign. These disadvantages are of minor importance in lattice designs, 
which will be found almost as convenient as balanced designs. 

9.3 Basis of the Statistical Analysis 

9.31 Analysis without Recovery of Inter-block Information. To those 
who have previously used only the simplest designs, one feature of the 
incomplete block designs will be unfamiliar. The treatment averages 
must be adjusted in order to secure the full accuracy available in the de- 
sign. That this is necessary can be seen by examining, for example, the 
balanced lattice in table 9.1. In three of the 4 replications, treatments 
1 and 2 are in different blocks. Consequently, if the simple averages of 
the 2 treatments are compared, the effects of differences among blocks are 
eliminated only for the first replication. 

Yates developed two methods of analysis for the designs. The first 
is an analysis by standard least squares, reference (9.1), sometimes re- 
ferred to as the “intrar-block** analysis. Later he showed that com- 
parisons among the block totals also contained information about treat- 
ment effects that can be utilized in the larger experiments. This ap- 
proach is analogous to that in split-plot experiments, where it will be re- 
called that use is made both of comparisons within whole-units (corre- 
sponding to incomplete blocks), and of comparisons among the totals of 
different whole-units. In this section the original method of analysis 
will be outlined for balanced, lattice experiments. This method has not 
been entirely superseded by the newer analysis, because it must be used 
with small experiments. The newer analysis is sketched in section 9.32, 

For the original analysis the mathematical model and the assumptions 
are essentially the same as those for previous designs in this book, as 
described in section 3.2. In the balanced lattice there are treatments 
in blocks of k units, with (k + 1) replications. Let y^ q be the observa- 
tion for the qth treatment, which we suppose to be in thejth block within 
the zth replication. The model is 

Vijq ~ M + Ti + fiij + Tq + $ijq (9.1) 

where ju, Try, fy, and r q represent the effects of the mean, the replicate, 
the incomplete block, and the treatment, respectively, and e^ q is the 
intra-block residual or error, assumed to be normally and independently 
distributed with mean zero and variance a e 2 . As with previous designs, 
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we find estimates of the parameters by minimizing 

22 (Vih ^ Vi ^ij lq) 

subject to the usual relations 

Z Vi = 0, Z hi = o (for any i), Z U = 0 (9.2) 

* i a 

The normal equation for any unknown is obtained as before by equat- 
ing the observed total to the expected total over all units whose equation 
contains the constant. Thus, if T q denotes the total for all (k +1) 
units that receive the qih treatment, the normal equation for t q is 

Tq = (k + 1) m + Z h + (fa + DU (9-3) 

Hie terms in the Vi having disappeared in virtue of equations (9.2), since 
any treatment appears in all replicates. 

This equation does not at once give the value of t qi since it still contains 
the unknowns rn and by. From the normal equation for m, it is easy to 
see that m is the average over the whole experiment. The normal equa- 
tion for any 6# is 

Bij = km + kpi + kbij (ft. 4) 

where J3# is the observed block total, and the treatments sum is over the 
k treatments that are in the block. If we add these equations for all 
blocks that contain the qih treatment, we obtain 

B t = k(k + l)m + kZ hi + Z Z U (9-5) 

where B t is the total of all such blocks. 

It is at this point that the structure of the design is important. In 
the blocks that contain t q , every other treatment occurs once and only 
once, while t q appears (k + 1) times. Hence 

22 22 k * (k + 1 )t q + 22 (all other Vs) « kl q + 22 (all “ kt q 
from (9.2). 

This simplifies (9.5) to 

Bt ~ k(k + 1 )m + k}2 b £j + kt q (9.6) 

We may now eliminate the unwanted term 22 % from (9.3) and (9.6), 
giving 

kT q - B t - k\ (9.7) 

a relatively simple solution. In effect, we simply adjust the treatment 
total for the total of the blocks in which it appears. 
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Since t q represents the deviation of the estimated treatment mean from 
the mean for all treatments, it is customary in practice to compute 
(t q + m), which represents the estimated mean itself. A little algebraic 
manipulation from (9.7) shows that 


t q + m 


T q [kT q - (k + l)B t + G\ 
7(5 + 1 7c 2 (fc + 1) 


(9.8) 


where G — k 2 (k + 1 )m is the grand total for the whole experiment. 
Note that T q /(k + 1) is the simple or unadjusted treatment mean. The 
last term on the right therefore represents the adjustment that is applied 
to the ordinary mean. This result will be used in a later comparison with 
the newer analysis. 

A continuation of the analysis shows that the variance of the difference 
between two t q values is 2a 2 /k. If the experiment had been in ran- 
domized blocks, with the same number of replications, (fc + 1), the cor- 
responding variance would have been 2 er 2 /(/c +1), where cr 2 is the error 
variance for randomized blocks. Hence the new design, with this method 
of analysis, gives a more accurate experiment than randomized blocks if 
and only if 

<r r 2 h 

ff* < LPT 


The quantity k/ (k + 1) is called the efficiency factor of the design. Note 
that the incomplete block design would be less accurate than randomized 
blocks if the variation among incomplete blocks were as great as that 
within complete replications. With the newer method of analysis this 
disadvantage is largely removed. 


9.32 Analysis with Recovery of Inter-block Information. This analysis 
rests on more difficult theory and will be outlined only in part. Formally, 
the mathematical model is exactly the same: 

Vm = g + TTi + T q + fa + Cyq (9.9) 

However, the additional assumptions are made that the block effects 
are normally and independently distributed with zero means and 
variance o*& 2 , and that they are independent of the e^ qt It follows from 
the assumptions that for the difference between two observations in the 
same block, the residual variance is 2crJ*, since the ft/s cancel, whereas 
for the difference between two observations in different blocks, the 
residual variance is 2(cr/ + <r b 2 ). The reader will note the analogy with 
split-plot experiments (section 7.12). With regard to the justification 
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for the additional assumptions, it can be said that in agricultural field ex- 
periments, for which the designs were first developed, it is usually just 
as reasonable to make the assumptions for the % as for the e ijq . The 
assumptions cannot be taken for granted, and with certain types of data 
we might not wish to make them. 

As a result of the assumptions, observations that, are in the same block 
are positively correlated, so that the simplest type of least squares 
estimation cannot be applied. Instead, we may write down the joint 
frequency distribution of all the observations, and use the more general 
method of estimation known as maximum likelihood. By a well-known 
device in theory, maximum likelihood estimation may be shown to be 
equivalent to the minimization of a weighted sum of squares, consisting 
of two parts. The first part is the sum of squares of deviations of the 
residuals from their block means; the second is the sum of squares of the 
residuals of the block totals. The two parts receive weights w and w'/k, 
respectively, where 



1 


For the moment we suppose that w and w' are known. For the balanced 
lattice, the quantity to be minimized is 

w ! 

2 - Vir - - ky)T + 2 — \ B a - km - k Vi - kt^.f 

(9.10) 

where ynJa. denote, respectively, the observed mean of a block and the 
mean of the f s that occur in the block. The first sum is over all observa- 
tions, the second over all blocks. 

When we differentiate with respect to a given t q , we must note that the 
term Uj, will contain t q) with coefficient 1/fc, whenever the block in ques- 
tion contains the gth treatment. The derivative, when equated to zero 
(omitting a factor —2), gives 

w r 

Vir 4“ Uy] + Lt ^ ^ hpi kUj ■ 1 0 

* (9.11) 


The first sum is over the observations that receive the qth treatment; 
the second over the blocks that contain this treatment. The derivative 
contains an additional part arising from the terms in in the first 
bracket of (9.10), but this contribution will be found to vanish. 

The next step is to simplify (9.11). Both brackets involve a sum of the 
terms Uj. • In each case, because of the symmetry of the balanced lattice, 
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the sum contains all other C s once, and t a (/>; + 1) times. Hence 


^ (fc + 1)1, + (rest of t’s) _ kt q _ 

^ tij - ~ k k ~ 9 


since the total of all t's is zero. Further, we may note that the observed 
block mean y ti . equals B t j/k. Writing as before T q for the treatment 
total and B t for the total of all blocks in which the treatment appears, we 
have from (9.11) 


w 


T q — - — (k + 1 )t q + i g 


w 


-) [fjj — k{k -j- l)j? i — ktq] = 0 

0 . 12 ) 


or 


(Jew + w')t q + (k + 1 )w'm = wT q — (w — w f ) ■ 


B t 


(9.13) 


The constant m may be shown to be the mean for the whole experiment, 
or G/k 2 (k + 1). After some rearrangement we may express (9.13) in a 
form comparable with the original estimate given in (9.S), as follows. 


tq + m 


T q Q 10 - w f )[kT q z (k + 1 )B f + G] 

k + 1 k(k + 1 )(kw + w') 


(9.14) 


The adjustments are seen to be of the same form in the new as in the 
original analysis. The two are equal when w f is zero. This occurs when 
<rb 2 is very large; that is, when differences among blocks are great. In 
all other eases the adjustments are smaller with the new than with the 
original analysis. They reduce to zero when w l = w, which happens 
only if <x b 2 is zero, or in other words if there are no real differences among 
blocks. Thus the new analysis is a generalization of the original anal- 
ysis, reducing to it in the extreme case where there are marked variations 
among incomplete blocks. At the other extreme, when the arrangement 
into blocks has been ineffective, the new analysis makes no adjustments 
for the non-existent block differences, and in fact reduces to an analysis 
by the method for randomized blocks. 

In practice the weights w and w f are not known. Yates has shown 
that they can be estimated from the analysis of variance, in the same way 
as we estimate two separate errors in a split-plot experiment. The de- 
tails will not be given. The fact that weights are estimated rather than 
exact introduces some additional sampling variation into the adjust- 
ments. This is unimportant in the larger experiments, but in certain 
of the smaller experiments the weights cannot be estimated accurately 
and the original analysis is recommended. For other accounts of the 
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analysis by different approaches, see references (9.2), (9.3), (9.4), and 
(9.5). 


9.4 Comparison of Incomplete Block and Randomized Block Designs 

So far as the experimental operations are concerned, incomplete block 
designs are no more difficult than randomized blocks. Some extra 
planning is involved in drawing up and randomizing the experimental 
plan, especially if care is taken to make the best possible grouping of the 
experimental units. According to the computing equipment and ex- 
perience available, the time required for the statistical analysis may ex- 
ceed that for randomized blocks by 20 to 150%. 

The gain in accuracy over randomized blocks depends on the type of 
experimental material and may be expected to increase as the number of 
treatments is increased. Most of the incomplete block designs cover the 
range from 6 to 200 treatments, while the cubic lattices extend this range 
to 1000 treatments without requiring a large incomplete block. The 
number of treatments for which a substantial increase in accuracy is at- 
tained must be determined by experience. If the experimental material 
is highly variable and yet lends itself to the formation of small groups 
which are homogeneous, the designs may be advantageous even with 
small numbers of treatments. From the results of varietal trials a num- 
ber of comparisons with randomized blocks have been made: see, for ex- 
ample, references (9.6), (9,7), and (9.8). The experiments varied in size 
from the 3 X 3 to the 11 X 11, and included both lattices and lattice 
squares. They indicated an average gain in accuracy of the order of 
25%. This means that 4 replications of an incomplete block design were 
about as accurate as 5 replications of randomized blocks. 

The ease with which the number of replications can be increased is 
also a factor. The object in the new designs is to obtain the most ac- 
curate comparisons that are possible from a given number of experi- 
mental units. Accordingly, the designs are likely to be most helpful when 
the amount of experimental material or considerations of cost and labor 
force the experiment to be smaller than is desired. Where the number of 
replicates can be increased without difficulty, the experimenter may pre- 
fer some extra replication of a simpler design which avoids the calcula- 
tion of adjustments. 

There is one important property, possessed by many of the designs, 
which increases their attractiveness relative to randomized blocks, As 
the plans show, the lattice square and the lattice designs are arranged 
in complete replications as well as in incomplete blocks. A few of the 
balanced incomplete block designs can also be grouped into replications. 
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Such designs may be regarded as randomised block designs which have 
additional restrictions within each replication. It has been shown by 
Yates (9.2) that these designs can be analyzed as if they were ordinary ran- 
domized blocks . This implies that the unadjusted treatment means give 
unbiased estimates of the true treatment effects, and that the F- and l- 
tests do not lose their validity. Of course, this analysis will in general 
be less accurate than the complete analysis. 

Thus, if there is any criterion for forming incomplete blocks, an in- 
complete block design is worth a trial in preference to a randomized 
block design which occupies the same set of replications. When the data 
have been collected, the experimenter may choose whether to analyze 
them as randomized blocks or to complete the full analysis, with the 
adjustments for incomplete block variations. In fact, if the variation 
among incomplete blocks is no greater than that within blocks, the com- 
plete statistical analysis reduces automatically to that for randomized 
blocks, as mentioned in section 9.32. 

The analysis as randomized blocks may be useful in experiments where 
several measurements are made on each experimental unit. In the for- 
mation of incomplete blocks the \mits are usually grouped with regard 
to the most important measurement. For certain other measurements 
from the same experiment, the grouping may be less effective; in such 
cases the “randomized blocks” analysis will be satisfactory, as also with 
measurements of subsidiary interest, where the greatest attainable ac- 
curacy is not required. 

In some types of research an appreciable number of units are likely to 
be injured or destroyed in the course of the experiment, so that they must 
be omitted from the statistical analysis. With incomplete data (except 
where only a few units are missing) laborious computations are required 
to calculate the block (or row and column) adjustments. Consequently, 
in experiments where missing data are of frequent occurrence, incom- 
plete block designs cannot be used to full advantage. Even in this case 
nothing is lost by using an incomplete block design which can be ar- 
ranged in complete replicates. If when the experiment is completed it 
becomes evident that an appreciable number of units must be discarded, 
the experimenter may use the randomized blocks analysis in which the 
extra complication due to missing data is smaller. The same con- 
siderations apply in cases where certain treatments may have to be 
ignored in the final analysis. 

9.5 Comparisons with Other Designs 

Yates (9.1) has discussed three other designs that have been used 
when the number of treatments is large. 
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9 .51 Systematic Controls. In agricultural field experiments, a control 
variety is sometimes placed at regular intervals over the site, the ex- 
perimental varieties being arranged in complete blocks. From the yields 
of the control plots a fertility index may be calculated for every plot in 
the experiment. These indices are used to adjust the experimental plot 
yields for local variations in fertility. 

The method of systematic controls is very flexible, since it can be used 
with any number of treatments and any number of replicates. Little 
evidence is available about the increase in accuracy obtained from the 
controls, though it seems probable that the increase is seldom large if the 
extra space occupied by the controls is taken into account. The calcula- 
tion of the best adjustments, as described by Yates (9,1), is rather tedi- 
ous, but if a crude type of adjustment is made most of the potential ad- 
vantages of the method may be lost. 

There may be additional reasons for the presence of extra controls, 
e.g\, for their use in an observational scoring of the experimental ma- 
terial. In this connection it should be noted that extra controls can be 
included in an incomplete block design. 

9.52 Random Controls. In another method the treatments are di- 
vided into groups, the grouping remaining the same in all replicates. 
Each group is arranged in a separate randomized block or latin square 
experiment. In order to obtain comparisons between treatments that 
are in different groups, one or more controls are included in each group 
and randomized along with the other treatments. Before comparing 
treatments that are in different groups, we subtract from each treat- 
ment mean the mean of the controls that are in the same group. Thus 
the controls serve to correct for differences in the fertility of the sites on 
which different, groups are tested. 

This design allows considerable flexibility in number of treatments and 
amount of replication and is simple to analyze. From theoretical con- 
siderations it is likely to be inferior in accuracy to a comparable in- 
complete block design, if one exists. Moreover the error variance is not 
the same for all types of comparison. If only one control is included, 
the variance is twice as great for the difference between two treatments 
that are in different groups as for two treatments in the same group. 

9.53 Split-plot Designs. This arrangement is a variant of the previous 
design which avoids extra controls. As in section 9.52, the treatments are 
first divided into a number of groups of equal sizes. Instead of testing 
each group on a separate site, the groups are combined into a single ex- 
periment of the split-plot type. For example, with 25 treatments and 
5 replications, we might first divide the treatments into 5 groups, A, B , 
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C ) D, and Es The groups are now regarded as whole-plot treatments, 
and could be arranged in a 5 X 5 lathi square on plots 5 times as large as 
the basic plots. Within each group the treatments are arranged at ran- 
dom on the individual plots. 

From the analysis for a split-plot design (section 7.15), it will be seen 
that we can test the difference between two treatments whether they 
are in the same group (i.e., whole-plot) or in different groups. Controls 
are no longer needed. The advantages and disadvantages of this ar- 
rangement are in general similar to those of the “random controls”: the 
split-plot design is more accurate if the space that would have been al- 
lotted to the controls is utilized. Sometimes the split-plot design is 
really more appropriate than an incomplete block design because the 
treatments divide themselves naturally into groups, and comparisons 
between members of the same group are regarded as more important 
than comparisons between members of different groups. 


9,6 Choice of Incomplete Block Design 

Tables 9.5 (randomized incomplete blocks) and 9.6 (lattice squares 
and incomplete latin squares) form an index to the incomplete block de- 
signs in this book. Each table is arranged by number of treatments (/) 
and number of units per block (k) and shows the numbers of replications 
for which designs are available, with references to the plans. It is hoped 
that the tables provide a rapid means of locating a suitable design, if one 
has been constructed. 

Where the number of treatments and the size of block are fixed in ad- 
vance by the conditions of the experiment, little choice is available to 
the experimenter. In certain types of research, however, both the num- 
ber of treatments and the size of block (or row and column) can be varied 
to some extent without impairing the experiment. 

Where more than one design appears appropriate, a design which can 
be arranged in separate replicates is preferable to one which cannot, and 
a balanced design is preferable to a partially balanced design. These 
recommendations usually narrow the choice to one or two designs. For 
example, with 25 treatments to be compared in 8 replications, we might 
use (see table 9,5) either a balanced incomplete block design in blocks of 
4 units, or a lattice design (partially balanced) in blocks of 5 units. The 
latter design, though not the former, can be laid out in separate repli- 
cates. 

The relative advantages of lattice and lattice square designs can be 
learned only by experience. From the results of a lattice square experi- 
ment we may estimate what the standard error would have been if 



TABLE 9.5 Designs aeeanged in randomized incomplete blocks 


Num- 
ber of 
treat- 
ments, 
t 

Num- 
ber of 
units 
per 
block, 
k 

Number of 
replications, 

r * i 

Plan 

number | 

Num- 
ber of 
treat- 
ments, 
t 

Num- 
ber of 
units 
per 
block, 
k 

Number of 
replications, 
r * 

Plan 
number { 

p.b.f 

i 

b-t 


p.b.t 

b-t 


4 

2 


3 

11. 1 

16 

4 

2 • • • 

6 

10.2 


3 


3 

11. 


6 


6 

11 .27 

5 : 

2 


i 4 

11,2 


6 


9 

11.28 


3 


G 

11. 


10 


10 

11 .29 


4 


1 4 

11. 

19 

3 


9 

11.30 

0 

2 


5 

11.3 


9 


9 

11 .31 


3 


5 

11.4 


10 


10 

11 .32 


3 


10 

11.5 

20 

4 

2,3 


10.11 

l 

4 


10 

11.6 

21 

3 


10 

11.33 


5 


5 

11. 


5 


5 

11.34 

7 

2 


6 

11. 


7 


10 

11.35 


3 


3 

11.7 

25 

4 


8 

11.36 


4 


4 

11.8 


5 

1 2.*- 

6 

! 10.3 


G 


6 

11. 


9 


9 

11.37 

8 

2 


7 

11,9 

27 

3 

3 


.10. 


4 


7 

11.10 1 

28 

4 


9 

11.38 


7 


7 

11. 


7 


9 

11.39 

9 

2 


8 

11. 

30 

5 

2,3 


10.12 


3 

2,3 

4 

10.1 

31 

6 


6 

11.40 


4 


8 

11.11 


10 


10 

11.41 


5 


in 

11.12 

36 

6 

2,3 


10.7 


6 


8 | 

11.13 

37 

9 


9 

11.42 


8 


! 8 

11, 

41 

5 


10 

11.43 

10 

2 


i 9 i 

11.14 

42 

6 

2,3 


10.13 


3 


1 9 

11.15 

49 

7 

2 * * * 

8 

10.4 


4 


6 

11.16 

56 

7 

2,3 


10.14 


1 5 


1 9 

11.17 

57 

8 


8 

11.44 


: C 


9 

11.18 

G4 

4 

3 


10. 


1 9 


9 

11. 

64 

8 

2 ••• 

9 

10.5 

11 

2 


io ! 

U. 

72 

8 

2,3 


10.15 


5 


5 

11.19 

73 

9 


9 

11.45 


G 


G 1 

11.20 

81 

9 

2 ■* 

10 

10.6 


i 10 


10 

11. 

90 

9 

2,3 


10.16 

12 

3 ! 

2,3 


10.10 ! 

91 

10 


10 

11.40 

13 

1 3 


6 

11.21 

100 

10 

2,3 


10,8 


4 


4 

11.22 ! 

121 

11 

2 ••• 


10.(12,7) 


9 


9 

11,23 

125 

5 

3 


10, 

15 

3 


7 I 

11.24 | 

144 

12 

2, 3,4 


10,9 


7 


7 

11.26 

169 

13 

2 — 


10.(12.8) 


8 


8 1 

11.26 







* Or any multiple of this number, 
t p.b. =» partially balanced designs. 

b. balanced designs, 

% References 10., 11., denote chapters. 
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TABLE 9.6 Designs arranged in lattice squares and incomplete 

LATIN SQUARES 


Num- 
her of 
treat- 
ments, 
i 

Num- 
ber of 
units 
per 
col. 
and 
row 

Number of 
replications, 
r* 

| 

1 

Plan 

number 

' 

Num- 
ber of 
treat- 
ments, 
t 

Num- 
ber of 
units 
per 
eol. 
and 

row 

Number of 
replications, 
r * 

Plan 

number 

p.b.t 

b.t 

p.b.t 

! b.t 

3 

3, 5 

5 


13.16 

11 

11, 5 


i 

5 

13.3 


3, 7 

7 


13.17 


11, G 


6 

13.4 


! 3,8 

i 8 


13.18 


11, 10 


10 

13. J 


3, 10 

10 


13.19 

13 

13, 4 


4 

13.5 

4 

4,8 


(3)5,6 

13. t 

! 

13,9 


9 

13.6 


4, 5 

5 


; 13.20 

15 

15,7 


7 

13.7 


4,7 

7 


; 13.21 


15,8 


8 

13.8 


4,9 

9 

i 

; 13.22 

16 

4,4 


5 i 

12.2 

5 

5,4 


4 

134 


16 ,6 


6 ! 

13.9 


5, G 

6 ! 


13.23 


16, 10 


10 

13.10 


3, 9 

» 1 


13.24 

1 19 

1 19,9 


0 

13.11 

6 

6, 5 

! 

5 

13.f 

i 

1 

19, 10 


10 

13.12 


6,7 

7 | 


13,25 

! 31 

21, 5 

1 

5 

13.13 

7 

7,3 


3 

13.1 

25 

5,5 


3 

12.3 


7,4 

i 

4 

13.2 

j 31 . 

31,6 


G 1 

13.4 


7,6 


6 

134 

i 37 

37, 9 


9 i 

13.15 


7,8 

8 


13.26 

i 49 

7,7 

3 

4 

12,4 

8 

7,S 


7 

134 

1 04 

8,8 

3 . . . 

9 

12.5 

9 

3,3 


! (2)5,4 

12.1 

; si 

9,9 

3,4 

5 

12.6 

9 

9,8 


8 

! 134- 

i 121 

11, 11 

3 • 

(> 

12.7 

10 

10,0 

i 


9 

134 

1(S9 

13, 13 

3* • ♦ 

7 

12.8 


* Or any multiple of this number, 
t p.h. = partially balanced designs, 
b. « balanced designs. 

{ Constructed from a l Xt latin square by omission of last column. 

§ ( ) Not enough degrees of freedom for error. 

either of the two types of grouping had not been used. Consequently the 
experimenter may test additional methods of grouping by using lattice 
squares. If the extra grouping turns out to be ineffective, the accuracy is 
only slightly less than that of the corresponding lattice design. The 
statistical computations are, however, more laborious for lattice squares, 
since correction terms must be obtained for both row and column effects. 
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In general, the lattice squares may be expected to be successful in typos 
of experimentation where the latin square has been found superior to 
randomized blocks. 
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CHAPTER 10 


LATTICE AND CUBIC LATTICE DESIGNS 
10.1 Balanced Lattices 

10.11 Description. The number of treatments must be an exact 
square. The size of block is the square root of this number. Incomplete 
blocks are combined in groups to form separate replications. The special 
feature of the balanced lattice, as distinguished from other lattices; is 
that every pair of treatments occurs once in the same incomplete block. 
Consequently, all pairs of treatments are compared with the same degree 
of precision. 

The numbers of replications are rather severely restricted, as well as 
the numbers of treatments. The useful plans are indexed below. Bab 

Balanced lattices contained in plans 10, 1-10.0 

Number of treatments 9 16 25 49 64 81 

Units per block 3 4 5 7 8 9 

Replicates 4 5 6 S 9 10 

anced lattice designs cannot be constructed for 36 treatments, and none 
has been found for 100 or 144 treatments. 

10.12 Statistical Analysis. The analysis is comparatively easy. It 
will be illustrated by an experiment on the effects of 9 feeding treatments 
on the growth rates of pigs, conducted by the North Carolina Agricul- 
tural Experiment Station. The analysis has been described in more de- 
tail by Comstock et ah (10.1), and the nutritional information obtained 
from the results by Peterson ct ah (10.2). 

For pigs of a given breed, previous experience indicated that a con- 
siderable part of the variance in growth rate between animals can be 
ascribed to the litter. Hence the experiment was planned so that litter 
differences would not contribute, to the intra-block error. The pigs were 
divided into sets of 3 litter-mates. Two sets of 3 were assigned to each 
block. Within a block, each treatment received one member of each set. 
Thus the experimental unit was composed of 2 pigs each feeding in the 
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same pen. The plan and growth rates are given in table 10.1; the plan 
has been rearranged so that it follows the same pattern as plan HU. 


TABLE 10.1 Gains in weight (pounds per day) for a total of 2 pigs 


Block 


Rep. I 

Totals 

Block 


Rep. 

II 

Totals 

(1) 

(1) 

2.20 

(2) 

1.84 

(3) 

2.18 

6.22 

(4) 

(1) 

1.19 

(4) 

1.20 

(7) 

1.15 

3.54 

(2) 

(4) 

2.05 

(6) 

0.85 

(6) 

1.86 

4.76 

(5) 

(2) 

2.26 

(5) 

1.07 

(8) 

1.45 

4.78 

(3) 

(7) 

0.73 

(8) 

1.00 

(9) 

1.76 

4.09 

(0) 

(3) 

2.12 

(0) 

2.03 

(9) 

1.63 

5.78 



Rep. Ill 

15.07 



Rep. 

IV 

14.10 

(7) 

(1) 

1.81 

(5) 

1.16 

(9) 

1.11 

4.08 

(10) 

(1) 

1.77 

(6) 

1.57 

(8) 

1.43 

4.77 

(8) 

(2) 

1.76 

(6) 

2.16 

(7) 

1.80 

5.72 

(11) 

(2) 

1.50 

* 

(4) 

1.00 

(9) 

1.42 

4.52 

(9) 

(3) 

1.71 

(4) 

1.57 

(8) 

1.13 

4.41 

(12) 

(3) 

2.04 

(5) 

0.93 

(7) 

1.78 

4.75 





14.21 





14.04 


Treatment totals and adjustment factors 






Adjusted total 

Mean per 


T 

B t 

W « (3 T - 4ft + G) 

T + ju W 

unit 

1 

0.97 

18.61 

+3.89 

7.21 

1.80 

2 

7.36 

21.24 

-5.46 

7.02 

1.76 

3 

8.05 

21.16 

—3.07 

7.86 

1.96 

4 

6.42 

17.23 

+7.70 

6.91 

1.73 

5 

4.'01 

18.37 

—4.03 

3.76 

0.94 

6 

7.62 

21.03 

-3,84 

7,38 

1.84 

7 

5.46 

18.10 

ife 

5.55 

1.39 

8 

5.61 

18.05 

W +2.05 

5.74 

1.44 

9 

5.92 

18.47 

+1.30 

6.00 

1,50 


G * 57.42 

172.26 

0.00 

57.43 



The steps in the analysis are as follows. The algebraic formulae refer 
to a k X k lattice in blocks of k units, with r — (k + 1) replicates. 

1. Calculate the block totals, the replication totals, the grand total G, 
and the treatment totals T, shown under the plan. 

2. For each treatment, calculate the total B% for all blocks in which 
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the treatment appears. For treatment 4, this is 

B t = 4.76 + 3.54 + 4.41 + 4.52 - 17.23 


As a check, the total of the B t values should be k times the total of the T 
values. 

3. Compute the quantities 

W » kT - (fc + 1 )B t + G 


whose sum should be exactly zero. 

4. The analysis of variance is now obtained. The total s.s. and the 
sums of squares for replications and treatments are found in the usual 
way. The sum of squares for blocks within replications, adjusted for 
treatment effects, is 


X> 2 _ (3.89) 2 + (5.46) 2 + • • • + (1.30) 8 

k\k + 1) “ 108 

5. Calculate the adjustment factor 


(E b - E e ) 


0.1776 - 0.0773 
^X0O776~” 


= 0.0628 


where E b and E c are the blocks and intra-block m.s., respectively. The 
adjusted treatment total is ( T + jjlW) as shown in table 10.1 . To avoid 
confusion, the adjusted means are shown per unit (total of 2 pigs) al- 
though a mean per pig would be more natural. If E b is less than E C} n 
is taken as zero, and no adjustments are applied to the treatment totals. 


TABLE 10.2 Analysis op variance for total growth rath of 2 pigs 

d.f. 



General 

k - 3 

s.s. 

m.s. 

Be plications 

k 

3 

0.0774 


Treatments 

- 1) 

8 

3.2261 


Blocks (adj.) 

<p - i) 

8 

I .4206 

0.1776 

Intra-block error 

(k - l)(/c 2 - 1) 

16 

1.2368 

0.0773 

Total 

(fc 3 + k 1 - 1) 

35 

5.9609 



6, For f-tests, calculate the effective error m.s. 

Ei - E e ( 1 + kix) = 0.0773(1 + 3 X 0.0628) - 0.0919 

The purpose of the adjustment factor is to increase E e so as to take ac- 
count of sampling errors in the block correction values /xTF. The ordi- 
nary rules for the calculation of ^tests may now be applied to #/. Thus 
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10,13 

the variance of the difference between two adjusted treatment totals is 
2 rE/i while that for the difference between two adjusted treatment 
means is 2 Z?//r, or 0.0460. The standard error of this difference is 0.214. 
If means per pig were taken, the standard error would be 0,107. 

7. The treatments m.s. in table 10.2 cannot be tested against the 
intra-block error m.s., since the former contains some block effects. For 
an approximate /'-test, calculate the sum of squares of deviations of the 
adjusted treatment totals, which comes to 12.(5771. This is divided by 
r or 4 to bring to a single-unit basis, and by (ft 2 — 1) or 8 to obtain the 
mean square, 0.3902. This is tested against the effective error m.s., 
0.0919. The /’-ratio, 0.3902/0.0919, or 4.31, has 8 and 10 di. 

S. In order to estimate the precision relative to randomized blocks, 
pool the mean squares for blocks (adjusted) and the intra-block error. 
The result is 0.1107, with 24 di., and is an unbiased estimate of the error 
variance that would have been present if the experiment had been ar- 
ranged in randomized blocks. This figure is compared with the effective 
error m.s., 0.0919. The relative precision is 0.1107/0.0919, or 120%. 
This means that 4 replications of the balanced lattice appear to have 
been slightly less accurate than 5 replicates of randomized blocks. 

These designs may be used in factorial experiments where it is desired 
to avoid any sacrifice of replication on the interactions relative to that 
on the main effects. Some of the combinations of levels that may be 
tested in balanced lattices are: 3 X 3, 8 X 2, 4 X 4, 4 X 2 X 2, 2 4 , 5 X 
5, 7 X 7, S X 8, 4®, 9 X 9, 3 4 . In analyzing a factorial experiment, 

first carry through the procedure described in this section. The sum of 
squares for deviations of the adjusted treatment totals is then divided 
into the sums of squares for main effects and interactions in the usual 
way; a divisor r must be introduced in order to convert these sums of 
squares to a single-unit basis. The resulting mean squares are tested 
against the effective error m.s., E e (l + ft/i), which has (k ~~ l)(k 2 — 1), 
or in this case, 16 di. 

10,13 Missing Data. Since lattice designs are often used in large ex- 
periments, involving perhaps several hundred observations, it is not easy 
to ensure that all the observations are accurately made. Even with care- 
ful management of the experiment, there is always a chance that mistakes 
or accidents will affect a few of the observations. Consequently, missing 
data tend to be more common with lattices than with small experiments. 

Methods for the analysis of the results of lattice experiments with in- 
complete data have been developed by Cornish (10.3). As might be ex- 
pected, the computations are lengthy. They become simpler in two 
special cases. The first arises when the incomplete blocks are ineffective, 
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so that the analysis reduces to a “randomized blocks” analysis. In this 
event values are substituted for the missing observations by the formula 
which applies to randomized blocks. The second case occurs at the other 
extreme when the variation among blocks is so large that the inter-block 
information is negligible. Here the correct procedure is to insert values 
for the missing observations by minimizing the intra-block error 
m.s. 

Since the general procedure must reduce to these two special pro- 
cedures in the appropriate cases, it might be anticipated, as Cornish’s 
solution shows, that both the “randomized blocks” and the “intra- 
block” estimates for the missing observations are required. Thus we 
have to find two estimates for each missing value. Similarly, two anal- 
yses of variance are necessary, one to obtain the correct value for the 
intra-block error m.s. and one for obtaining the mean square among 
blocks. 

In an attempt to reduce the amount of arithmetic, some investigation 
has been made of the consequences of using a single estimate and a single 
analysis of variance. For this purpose the “intra-block” estimate seems 
the better of the two. It gives the correct intra-block error m.s. ; the 
block m.s. is in general slightly too high. It provides an excellent ap- 
proximation when the blocks are effective and is at its worst when blocks 
are ineffective; that is, when a “randomized blocks” analysis should have 
been used. In the latter case it still gives unbiased estimates of the 
treatment means, but they are not as accurate as the estimates obtained 
by the use of the “randomized blocks” formula. The chief defect of the 
“randomized blocks” formula is that when block variation is large it 
tends to give an overestimate of the intra-block error m.s., and some- 
times the bias is substantial. For this reason the “randomized blocks” 
estimate is considered more hazardous for general use, despite its greater 
simplicity* 

The procedure that is given for the incomplete block designs in this 
and succeeding chapters is to insert values for the missing observations 
by means of the “intra-block” formula. Thereafter the analysis is con- 
ducted in the usual way for complete data, except that in the analysis 
of variance 1 di. is subtracted from the intra-block error for each miss- 
ing observation. This method, or the more accurate Cornish method, 
should be used whenever it is intended to recover inter-block information. 
If many observations are missing, or if it is evident on inspection that 
blocks are relatively ineffective, the experimenter may decide at the start 
to analyze the data as a randomized block experiment. In this case sub- 
stitutes for the missing values should of course be obtained by the 
formula for randomized blocks (section 4.25). 
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The “intra-block” formula for the estimate of a missing observation 
in a k X k balanced lattice with (k + 1) replications is as follows. 


k 2 T + k(k + 1)£ — R + G - Mi, - kB t 

*(*- D a 


( 10 . 1 ) 


In this formula, T, B } and R denote as usual the totals for the treatment, 
block, and replication which contain the missing observation, while G is 
the grand total. Also 


= total (over all replications) of all treatments that appear in 
the block which has the missing observation. 

B t = total of all blocks in which the treatment with the missing 
observation appears. 


Example, Suppose that the observation 2.20 for treatment 1 in rep- 
licate 1 of table 10.1 had been missing. Form the block, replicate, treat- 


ment, and grand totals, just as in the ordinary analysis. It is helpful to 
insert an x for the missing observation and to include this x in all totals 
where it should appear. When the value of x has been found, it can then 
be inserted in all the appropriate places and the data are ready for com- 

puting the analysis of variance. 

The list of treatment totals appears as 

shown below. 

T 

1 

4.77 +x 

2 

7.36 

3 

8.05 

4 

6.42 

5 

4.01 

6 

7.62 

7 

5.46 

8 

5.61 

0 

5,02 

a ■■ 

- 55.22 +* 


The quantities needed are 

T = 4.77, £ = 4.02, £ = 12.87, G « 55.22 

T h = 4.77 + 7.36 + 8.05 - 20.18 
B t = 4,02 + 3.54 + 4.08 + 4.77 - 16.41 

Hence 

(9) (4.77) + (12) (4.02) - 12.87 + 55.22 - (3)(30.18) - (8)00.41) 


- 1.98 


12 



280 


LATTICE AND CUBIC LATTICE DESIGNS 


10.13 


The analysis of variance is now computed in the usual way, except 
that the total degrees of freedom are reduced from 35 to 34 and the 
intra-block error degrees of freedom from 10 to 15. The same adjust- 
ment factor pt, may be used (there is actually a slight change which can 
be ignored). 

When several observations are missing, the values are inserted by the 
method of successive approximation (illustrated in section 4.25). The 
exact formulae for f-tests are complicated. The following approximate 
rule assigns an effective number of replicates to each of two treatments 
A and B whose means are being compared. In any replicate of the ex- 
periment, A is credited with 0 replication if it is absent; with 0 replica- 
tion if A is present but B occurs in the same block and is absent; and with 
1 replication otherwise. The same rule is applied to B. For instance, in 
a ir test of the difference between treatments (1) and (3) in table 10.1, 
treatment (1) is credited with 3 replications and treatment (3) with 3 
replications (it loses 1 replication because in the first replicate of the ex- 
periment treatment (1) is in the same block and is missing). On the 
other hand, in comparing treatment (1) with treatment (4), the effective 
numbers of replications are 3 and 4, respectively. 

10.2 Partially Balanced Lattices 

10.21 Simple Lattices (Two Replicates). For an experiment with 2 
replicates, use the first 2 replicates of the appropriate set shown below 
(plans 10.1-10.6). Notice that designs are available for 36, 100, and 
144 treatments (plans 10.7-10.9) as well as for those numbers of treat- 
ments for which balanced designs are given. Plans for 121 and 169 
treatments can be taken from plans 12.7 and 12.8. 

The asymmetry of the designs is apparent from the plans; thus, with 
9 treatments (plan 10.1) the first treatment appears in the same block 
as treatments (2), (3), (4), and (7), but not in the same block as any of 
the remaining treatments. 

For 9 and 16 treatments, simple lattices are unlikely to be more ac- 
curate than randomized blocks unless the variation among incomplete 
blocks is great compared with that within incomplete blocks. Further, 
the numbers of degrees of freedom for estimating the error are only 4 and 
9, as against 9 and 16, respectively, for randomized blocks. 

10.22 Triple Lattices (Three Replicates), The first 3 replicates of 
plans 10.1-10.9, 12.7, and 12.8 are used. Designs may be obtained for 
all squares from 9 to 169. With 9 treatments the precaution mentioned 
in section 10.21 applies also to the triple lattice. 
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10.23 Four Replicates. These designs may be obtained either (i) by 
duplicating the simple lattice or (ii) by using a quadruple lattice, i.e., 
the first -1 replicates of the plans. With 30 or 100 treatments, only the 
first method can be used, since no quadruple lattice exists. The second 
procedure is slightly preferable, because the resulting design comes 
closer to symmetry, but the statistical analysis requires more time. 

10.24 Five Replicates. The first 5 replicates from the balanced set 
are taken. Designs are not included for 9, 30, 100, or 144 treatments, 

10.25 Higher Numbers of Replicates. Balanced designs should be 
used for the following numbers of treatments and replicates: 25, 6; 49, 
8; 64, 9; and 81, 10. For 9 treatments in 8 or 12 replicates, 16 treatments 
in 10 replicates, and 25 treatments in 12 replicates, the plan for the bal- 
anced design should be repeated. 

In other cases the following recommendations are made. 

Six replicates. Use the triple lattice twice. 

Eight replicates. Use the simple lattice jour times or the quadruple 
lattice twice . 

Nine replicates. Use the triple lattice three times . 

Ten replicates . Use the simple lattice five times or the quintuple lattice 
twice. 

The designs recommended are not always as fully balanced as they 
might be made. For instance with 6 replicates of 16 treatments the bal- 
anced design (5 replicates) plus 1 extra replicate gives a more nearly 
symmetrical arrangement than the triple lattice used twice. Since 
separate computing instructions would be required for such plans, we 
have preferred to adhere to a more uniform system. For this reason no 
account is given of designs with 7 replicates, though their statistical 
analysis presents no great difficulty to the reader who has mastered the 
principles. 

10.26 Arrangement of Experimental Material. In the arrangement of 
a group of experimental material so as to apply one of those designs, the 
most important criterion is that units within the same incomplete block 
be homogeneous. In field trials, for example, if the plots are oblong the 
usual procedure is to have the incomplete blocks as nearly square as 
practicable, the plots extending the whole length of one side of the block. 
Uniformity trial investigations have shown that this layout gives on the 
average the most homogeneous block. In cases where there is detailed 
knowledge of the experimental site, some other method of grouping may 
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be superior. Efforts should be made to keep the experimental technique 
uniform for all units in the same block. Changes in technique that are 
necessary should be made when changing from one block to another. 

When the blocks have been formed, there is a secondary advantage in 
forming replications so that blocks within the same replicate are as sim- 
ilar to one another as the material permits, since this increases the pre- 
cision of inter-block comparisons. If the full statistical analysis is carried 
out, it is more important to have the incomplete blocks homogeneous 
than to have the replications homogeneous. Thus, when the grouping 
of blocks into compact replications is troublesome, this criterion may 
be ignored without much loss of precision. 

On the other hand, it is desirable to have homogeneous replications if 
the data may subsequently be analyzed as a randomized block design. 
As we have pointed out, the randomized block analysis may be needed 
in fields of work where certain experimental units or whole treatments 
arc likely to be destroyed during the course of the experiment (e.g., in 
pasture plots subject to winterkill), 

10.27 Randomization. The randomization consists of three steps. 

1. Randomize the blocks separately and independently within each 
replication. 

2. Randomize the treatments separately and independently within 
each block. 

3. Allot the treatments to the treatment numbers at random. 

For methods of randomization, see chapter 15. 

Steps 1 and 2 give each treatment an equal chance of being allotted to 
any experimental unit. These steps correspond to the allotment of 
treatments to units at random in an ordinary randomized block design. 

The function of step 3 is to decide by random choice which groups of 
treatments will form the blocks of the design. If differences among blocks 
are large, the error variance per plot for the mean of a group of treat- 
ments which lie in the same block may be considerably higher than the 
average error variance. This additional randomization ensures that the 
average error variance may be used, in nearly all cases, for comparisons 
among groups of treatments. For further discussion, see reference (10.4). 

When a plan is repeated in order to obtain extra replications, a separate 
randomization must be made for every replicate. 

10.28 Statistical Analysis. An account of the theory, with worked 
examples of the simple and triple lattice, is given in reference (10.5) ; a 
systematic presentation of the shortest computational methods, with 
worked examples of the simple, quadruple, and balanced lattices in 
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(10.6); and a detailed description of the methods for carrying out the 
computations on I.B.M. (Hollerith) pimehed-card machines in (10.7). 
The computational methods differ slightly from one reference to another, 
but all are basically the same. 

Table 10.3 shows the plan and yields for a 5 X 5 simple lattice ex- 
periment on soybeans, the treatments being 25 varieties produced in the 


TABLE 10.3 Yields of a 5 X 5 simple lattice experiment on soybeans 


(Yields arc* in bushels per acre, minus 30 hu.) 




Rep. 1 



B 

a 

fiC 

(1) 6 

(2) 7 

(3) 5 

(4) 8 

(5) 6 

32 

4 61 

4 9.5 

(6) 10 

(7) 12 

(8) 12 

(0) 13 

(10) 8 

01 

- 8 

- 1.3 

(11) 17 

(12) 7 

(13) 7 

(14) 0 

(15) 14 

54 

4 48 

4 7.5 

(16) IS 

(17) 10 

(18) 13 

(19) 13 

(20) 14 

74 

- 15 

- 2.3 

(21) 14 

(22) 15 

(23) LI 

(24) 14 

(25) 14 

08 

4 17 

4 2.7 






289 

4103 

416.1 



Rep. II 






(1) 24 

(6) 13 

(11) 24 

(16) 11 

(21) 8 

80 

- 9 

- 1.4 

(2) 21 

(7) 11 

(12) 14 

(17) 11 

(22) 23 

80 

- 23 

- 3.6 

(3) 10 

(8) 4 

(13) 12 

(18) 12 

(23) 12 

56 

- 8 

- 1.3 

(4) 17 

(9) 10 

(14) 30 

(19) 9 

(24) 23 

89 

- 32 

- 5.0 

(6) 15 

(10) 15 

(15) 22 

(20) 16 

(25) 19 

87 

- 31 

- 4.8 






392 

— 103 

-10.1 


Treatment totals (unadj.) 


ju C 



(1) 30 

(2) 28 

(3) 21 

(4) 25 

(5) 21 

4*9.5 



(6) 29 

(7) 23 

(8) 10 

(9) 23 

(10) 23 

-i.S 



(11) 41 

(12) 21 

(13) 10 

(14) 39 

(15) 36 

47.5 



(16) 29 

(17) 27 

(18) 25 

(19) 22 

(20) 30 

-2.3 



(21) 22 

(22) 38 

(23) 23 

(24) 37 

(25) 33 

42.7 



mC-1.4 

-3.6 

-1.3 

—5.0 

—4.8 

0.0 




Treatment totals (adj.) 





(1) 38.1 

(2) 33.0 

(3) 29.2 

(4) 

29.5 (5) 25.7 



(6) 26.3 

(7) 18.1 

(8) 13.4 

(9) 10.7 (10) 16.9 



(11) 47.1 

(12) 24.9 

(13) 25.2 

(14) 41.5 (15) 38.7 



(16) 25.3 

(17) 21.1 

(18) 21.4 

(19) 14.7 (20) 22.9 



(21) 23.3 

(22) 37.1 

(23) 24.4 

(24) 34.7 (25) 30.9 




soybean breeding program of the North Carolina Experiment Station. 
The experiment actually contained 4 replications, obtained by duplicat- 
ing the plan for the simple lattice. The first 2 replications will serve to 
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illustrate the analysis for simple, triple, and quadruple lattices, which all 
follow the same general pattern. In section 10.29 the data from the 
whole experiment will be analyzed in order to indicate the procedure 
when the basic plan is repeated. 

1. Find the block totals B, the replication totals, the treatment totals, 
and the grand total, 

2. For each block calculate 

C = total (over all replicates) of all treatments in the block — rB 

For example, for block 1 in replication 1, 

C = 30 + 28 + 21 + 25 + 21 - (2) (32) = +61 

Find the replicate totals R c of the C values. These replicate totals 
should add to zero, 

3. The analysis of variance is as follows. 



a.f. 


s.s. 

m.s. 

Replications 

(r - 1) 

1 

212,18 


Treatments (unadj.) 

0 * 2 ~ D 

24 

559.28 


Blocks within replications (adj.) 

r(k - 1) 

S 

501.84 

62.7m 

Intra-block error 

[ft - 1 )(i* ~ k - 1) 

16 

218.48 

13.66E e 

Total 

W? - 1) 

49 

1491*78 



All sums of squares are found in the usual way except that for blocks 
(adjusted), which is given by 

X)C 8 X) R c _ (61) 3 + (8) 2 + • • • + (31) 2 (103) 2 + (103) 2 

kr(r — 1) k 2 r(r — 1) 10 50 

- 501.81 

where R a denotes a replication total of the C 3 s. 

4. The weighting factor used to obtain the adjusted treatment totals is 

M (Eb - Ee) ^ (62.73 - 1 3.66) _ 

** k(r - 1 )E b (5X82.73) ’ 

where Eh ami E c are, respectively, the mean squares for blocks and intra- 
block error. If Eh is less than E Ci the factor is taken as zero and no ad- 
justments are made for block effects, the experiment being analyzed as if 
in randomized blocks. 

Each C value is now multiplied by ju to obtain the block corrections 
jdCL These should sum to zero, apart from rounding errors. Each treat- 
ment total is adjusted by applying the appropriate correction for every 
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block in which I, lie (ivai incut appears. For example, the adjusted total 
for treatment 1 is 

25 -| 9.5 - 5.0 =-- 29.5 

111 the case of the simple lattice it pays to write the pC values around the 
table of unadjusted treatment totals, putting the values for replication 
1 in the right-hand column and those for replication 2 in the bottom row. 
Then the adjustments to any treatment total are at the end of the row 
and the bottom of the column in which the treatment lies. 

5, The error variance of the difference between two treatment means 
is slightly smaller for treatments that appear in the same block than for 
those that do not. The formulae are; 


Two treat monta in (he name block: 


2Ee (2){ 13.00) 

--U +0-- Dm! — -- - - — - - t l -F 0. 1504] 
r 2 

Two treatments not. in same block: 

— [1 + rix] - fJ + ( 2j(0 J564) j 

r 2 


Accra 


2/A f, , rkfi l (2)(l3.6B)r , (2)(5)(0.1£ 

V L 1 + WTT)1 - L + 0~ 


(2) ( 5) (0.1 504) j 


15.80 


17.93 


17.22 


The corresponding standard errors are 3.97, 4.23, and 4% 15, respec- 
tively. Except with small designs it is sufficient to use the average value, 
4.15, for all Atests between pairs of treatments. 

f>. The analysis of variance does not supply an /Rest of the adjusted 
treatment totals, A test of the unadjusted treatment totals can he ob- 
tained by analyzing the data as if the experiment were in randomized 
blocks. The error m.s. is the pooled mean square for blocks and intra- 
block error, as shown below, 

d.f. s.s. m.s. 

Treatments funadj.) (k 2 - 1) 24 559.28 23.30 

Error (/; 2 - l)(r - 1) 21 720.32 30.01 


This test is not fully sensitive, since it is based on unadjusted treatment 
tot als, but it will be sufficient in cases where an /Rest is of secondary in- 
terest or where differences among incomplete blocks are small. 

An /Rest of the adjusted treatment totals requires a little more cal- 
culation. This is most easily done by changing the unadjusted treat- 
ments s.s. (which is already available) so that it may be tested against 
the intra-block error m.s. Calculate B UJ the unadjusted sum of squares 
for blocks within replications, which comes to 350.00, If B a is the ad- 
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justed sum of squares for blocks within replications (501.84), we sub- 
tract the following quantity from the treatments s.s. 


fc(r — l)/x 


(r - 1)(1 + kn) 


B u 


b)< 


(5)(1)(0.1564) 


1 + (5) (0.1564) J 


(350.00) - (501.84) = -85.30 


In this case the subtraction term is negative, which is rather unusual. 
We add 85.30 to the unadjusted treatments s.s. The F - test is then com- 
pleted by the following analysis. 



d.f. 

s.s. 

m.s. 

Treatments 

24 

644.58 

26.86 

Intra-block error 

16 

218.48 

13.66 


The F-ratio, 1.97, lies between the 10 and the 5% levels. The F value 
differs considerably from that obtained by the randomized blocks test, 
as may happen when the adjustment for incomplete block effects results 
in a substantial increase in precision. 

7. The error m.s, in the randomized blocks analysis was found to be 
30.01. To estimate the gain in accuracy over randomized blocks, we 
compare this figure with the effective error variance 


E g 


1 + 


rkfL 


(k + 1)J 


- (13. 06) 


1 + 


(2) (5) (0.1 564) 
- . 


- 17.22 


The relative accuracy is 30.01/17.22, or 174%. 


10.29 Statistical Analysis for Repetitions of the Designs, Suppose 
that the basic design contains n replicates (where, e.g., n = 2 for a simple 
lattice) and that this is repeated p times, so that the total number of 
replications is r — up. For illustration, we present a joint analysis of 
the data in tables 10.3 and 10.4 (pp. 275 and 289), which show 4 repli- 
cations of the 5X5 simple lattice. Replication III is a repetition of 
replication I, and IV of II. The field results have been rearranged so 
that treatments follow the same order within corresponding replications. 
In terms of our notation, n = 2, p — 2, and r — 4, 

1. Calculate the block totals, the replication totals, the treatment 
totals (shown in table 10.4), and the grand total. 

2. Arrange the block totals in supplementary tables as on the right of 
table 10.4. In these tables, blocks that contain the same set of k treat- 
ments are placed in the same row. Each table has k rows and p columns, 
while the number of tables is ft. Form the row and column totals of each 
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table. For each group of similar blocks, compute tlie quant ities 

C ~ total (over all replicates) of the treatments appearing in the 
group — n (total of the group of blocks) 

Thus for the first block in replications II and IV, which contains treat- 
ments (1), ((>), (11), (16), and (21), 

C « 59 + 53 + SO + 03 + 5S - 2(157) - -3 

Find the replication totals li c of the V values; these should add to zero. 
3. The analysis of variance is as follows. 



dX 


s.s. 

111 .8. 

Replications 

(r - 1) 

3 

226.19 


Treatments (unadj.) 

(ft 2 - 1) 

24 

791.24 


Blocks within replications (adj.) 

r(k - 1) 

H» 

786.00 

49.12 Mi 

Component (a) 

n(p — 1 )(k — 1) 

8 

164.72 


Component (h) 

n(k — 1) 

8 

621.28 


Intra-black error 

C ft - D(i* - k - 1) 

56 

761.56 

13.60 E c 

Total 

(r/c 2 - 1) 

99 

2564.99 



The blocks s.s. contains two components, both of which are obtained 
from the supplementary table of block totals. 

Component (a) is a new component, which arises only when the design 
has been repeated. It is composed of the differences between the totals 
of blocks that contain the same set of treatments. Since each supple- 
mentary table has k rows and p columns, it may be analyzed as follows. 


Rows 

Columns 

Rows X coli nuns 


d.f. 


(k - 1) 

(p ~ 1 ) 

(k - 1 )(p - 1) 


Total 


(kp ~~ 1) 


The sum of squares for component (a) is the sum of the rows X columns 
interactions over all n tables. It may be obtained from the following 
calculations. 


Total: 

Rows: 


(32) 2 4 (61) 2 H 1- (75) 2 4- (Si) 2 (650)* 4 (749) 2 


k - 5 pk~ « 59 

(104) 2 4 (142) 2 4 • * • 4 (171) 2 (650) 2 4 (749) 2 


Columns: 


pk - 10 50 

(289) 2 4 (361 ) 2 4 (392) 2 4 (357) 2 (650) 2 4 (749) 2 


k 2 - 25 


50 


- 602,18 

« 309.28 

- 128.18 


Component (a) s.s, »» 602.18 — 309.28 — 128.18 164.72 
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The above is a general method that covers all cases. With a simple 
lattice, a quicker alternative is to calculate the differences between the 
totals of similar blocks. The sum of squares of deviations of these dif- 
ferences from their replication means, divided by 2k, gives the sum of 
squares for component (a). 

Component (h) is the component that is present even when there are 
no repetitions. 

Z Re 2 

Component (0 ... - 

(63) 2 + (29) 2 + * • * + (48) 2 (99) 2 + (99) 3 


20 


100 


« 621.28 

The sum of squares for blocks within replications (adjusted) is the pooled 
sum of squares for the two components. All other terms in the analysis 
of variance are computed in the usual way. 

4. If E b is the pooled mean square for blocks, and E c that for error, 
the weighting factor is 

p{E h - Be) (2) (35.52) 

fi = — - 0.1270 

k[(r - p)E b + (p~ 1 )E C ] (5) [(2) (49.12) + 13.60] 

where it will be recalled that p is the number of repetitions of the basic 
design, in this case 2. As usual, no adjustments are made if E b is less 
than E e . 

The block corrections jjlC are obtained and entered in table 10.4; they 
should add to zero except for rounding errors. Each treatment total is 
adjusted by applying a correction for each group of blocks in which the 
treatment appears. For instance, treatment (13) appears in the third 
block in replications I and III and in the third block in replications 1 1 
and IV. Its adjusted total is 


44 + 8.8 - 0,1 ~ 52.7 

5. The error variances for the difference between two treatment means 
are: 

Two treatments in the same block : 

- 2 — [1 + (A ~ Dm] - - [1 + 0.1270] = 7.70 

r 4 


Two treatments not in same block: 

2E, 


[1 + nfl j = [i + (2X0.1270)] * 8.56 

r 4 


2E e r nkn 1 
Average: [l + J 


nkfx 

ftTI). 


(2) (13.66) 


1 + 


(2) (5) (0.1 270) "j 

6 J 


8.28 
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TABLE 10.4 A 5 X 5 simple lattice (reps. IIT and TV) 









Block totals 




Rep. Ill 





1 

nr 

Sum 

a 

mC 

(D 13 

(2) 26 

(3) 9 

(4) 13 (5) 

11 


72 

32 

72 

104 

+63 

+ 8.0 

(«) 15 

(7) 18 

(8) 22 

(0) 1 1 (10.) 

15 


81 

61 

81 

142 

-29 

- 3,7 

(13) 19 

(12) 10 

(13) 10 

(14) 10 (15) 

16 | 


65 

54 

65 

119 

+69 

+ 8.8 

(16) 21 

(17) 10 

08) 17 

(19) 4 (20) 

17 ! 


75 

74 

75 

149 

—34 

- 4.3 

(21) 15 

(22) 12 

(23) 13 

(24) 20 (25) 

8 


68 

68 

68 

136 

+30 

+ 3.8 






| 361 289 

361 

650 

+99 

+12.6 


Rep. IV 





11 

IV 

Sum 

c 


(1) 16 

(0) 7 

(11) 20 

(16) 13 (21) 

21 


77 

80 

77 

157 

- 3 

- 0.4 

(2) 15 

(7) 10 

(12) 11 

(17) 7 (22) 

14 


57 

80 

57 

137 

+ 2 

+ 0.3 

(3) 7 

(8) 11 

(13) 15 

(18) 15 (23) 

16 


64 

56 

64 

120 

- 1 

- 0.1 

(4) 19 

(9) 14 

(14) 20 

(19) 6 (24) 

16 


75 

89 

75 

164 

-49 

- 6,2 

(5) 17 

(10) 18 

(15) 20 

(20) 15 (25) 

14 


84 

87 

84 

171 

-48 

- 6.1 






1 357 392 

357 

749 

-99 

-12.5 




Treatment totals (4 reps.) 





(1) 59 

(2) 

69 

(3) 37 

(4) 

57 


(5) 

49 

+8,0 


(6) 51 

(7) 

‘ 51 

(8) 49 

(9) 

48 


(10) 

56 

-3.7 


(11) SO 

(12) 

42 

(13) 44 

(14) 

69 


(15) 

72 

+8.8 


(16) 63 

(17) 

50 

(18) 57 

(19) 

32 


(20) 

62 

-4.3 


(21) 58 

(22) 

64 

(23) 52 

(24) 

73 


(25) 

55 

+3.8 


-0.4 

+0.3 

-0.1 


*-6.2 


-6.1 






Adjusted treatment- totals 





(1) 66.6 (2) 

77.3 

(3) 44.9 

(4) 

58.8 

(5) 

50.9 



(6) 46.9 (7) 

47.6 

(8) 45.2 

(9) 

38.1 


(10) 

46.2 



(11) 88.4 (12) 

51.1 

(13) 52.7 

(14) 

71.6 


(IB) 

74.7 



(16) 58.3 (17) 

46.0 

(18) 52,6 

(19) 

21.5 


(20) 

5LG 



(21) 61.4 (22) 

68,1 

(23) 55.7 

(24) 

70.6 


(25) 

52.7 




6. An approximate F-test of the adjusted treatment totals is obtained 
by the method described in the previous section, step 6. In this case the 
quantity that must be subtracted from the treatments s.s. is 

where B u is the unadjusted and B„ the adjusted sum of squares for com- 
ponent (b) of the blocks. Both quantities have already been computed: 
B a , 621.28, appears in the analysis of variance, while B u , 309.28, is the 
rows s.s. used in computing component (a). 
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7. A comparison with randomized blocks is made as in the previous 
section, step 7. The effective error variance of the lattice is taken as 


■E e 


nkp. 

} + (&~+7v 


10.210 Missing Data. If the missing observations are numerous, an 
analysis by the method for randomized blocks is recommended. If it is 
desired to carry out the full analysis, missing values are estimated by the 
formula which minimizes the intra-block error s.s*, though as explained 
in section 10.13 this method is only approximate. 

Experiments with no repetition of the basic design. 


(r - 1 )k 2 T -rR + G~ rkC + kC' 


( 10 . 2 ) 


In this formula T and R are the totals for the treatment and replicate 
that contain the missing value, G is the grand total, and C is the C value 
(defined in section 10.28) for the block which contains the missing obser- 
vation. The only unfamiliar quantity is C f : 

* 

C ; ~ total of the C values for all blocks which contain the treat- 
ment that has the missing value 

Example. For the experiment analyzed in section 10.28, suppose that 
3 observations are missing: treatments (1) and (12) in replication I and 


TABLE 10.5 Summary totals for a 5 X 5 simple lattice with three 

MISSING VALUES 


Treatment totals 


(1) 24+a; 

(2) 

28 

(3) 

21 

(4) 

25 

(5) 21 

(6) 29 

(7) 

23 

(8) 

16 

(9) 

23 

(10) 23 

(11) 41 

(12) 

14 4* y 

(13) 

19 

(14) 

9 

(15) 30 

(10) 29 

(17) 

27 

(18) 

25 

(19) 

22 

(20) 30 

(21) 22 

(22) 

38 

(23) 

23 

(24) 

37 

(25) 33 

Hep, I 





Rep. II 


B 


V 



B 


C 

26 4 -* 


67 - j* 



80 

. 

-15 + x 

61 


-8 



80 


-30 +y 

47 +y 


25 - y -f s 



56 


-8 

74 

- 

-15 



59 4- z 


-2 -3 

68 


17 



87 


-31 

Ri = 276 + x + y 

86 

- x - y 4* z 


= 362 + « -86 + x + y - 


O — 4* 2^2 8=5 638 4~ x -j- y -f* z 
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treatment (14) in replication II. Denote the estimates of the 3 missing 
values by x, y, and z, respectively; these will be obtained by the method 
of successive approximation. 

It is best to start by finding all block, replicate, and treatment totals 
and all C values, as in steps 1 and 2 of the computing instructions for 
this design. Wherever a missing observation is involved, include the ap- 
propriate x, y, or z in its place. The summary totals are shown in tabic 
1 0.5. Note that the total of the C values over the whole experiment is 
identically zero, so that a good cheek is available on this part of the cal- 
culations. 

We must now find first approximations for y and z. In the replications 
in which they are present, their values are 14 and 9, respectively. How- 
ever, replication II appears to have higher values than replication I, and 
it is probably worth while to make an adjustment for replication effect. 
If we ignore the missing values, the means per plot are about 12 for 
replication I and 15 for replication II. Since y is missing in replication 
1, we subtract 3 from its value in replication II, giving 1 1. For z we add 
3, giving 12 as the first approximation. Using y = 11, £ - 12, we now 
solve for x from formula (10.2). For r = 2, k = 5 the formula becomes 

25T - 2R + G - 10(7 + 5C' 

x = 

1G 


Put y = Jfi, z = 18. From table 10.5 we find for x 

T = 24; R - 276 + 11 - 287; 0 = 638 + 11 + 12 - 061 
C « 67; C" « 67 - 15 » 52 
(25) (24) - (2) (287) + 661 - (10) (67) + (5) (52) ^ 277 i 
x ~~~ iG ~~ ~~m ~ ] 

Put a: — 17, z — 12. For y wo have 

T = 14 ; R = 270 + 17 + 293 ; (1 = 038 + 17 + 12 = 007 


C = 25 + 12 - 37; C = 25 + 12 - 30 = 7 
(25) (14) - (2) (293) + 607 - (10)(37) + (B)(7) _ 90 _ 
V ~ 16 ~ 10 ~ 
Put x — 17, y = 6. For z we have 

T -9; R = 302; (7 = 038 + 17 + 6 = 601; V = -2 
C = -2 + 25 - 6 = 17 

(25) (9) - (2) (362) + 661 - (10)(-2) + (5) (17) _ 207 _ 
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7. A comparison with randomized blocks is made as in the previous 
section, step 7. The effective error variance of the lattice is taken as 


E e 


nhn 

l + - — 


10.210 Missing Data. If the missing observations are numerous, an 
analysis by the method for randomized blocks is recommended. If it is 
desired to carry out the full analysis, missing values are estimated by the 
formula which minimizes the intra-block error s.s., though as explained 
in section 10.13 this method is only approximate. 

Experiments with no repetition of the basic design. 


(r - 1 )k 2 T — rR + G — rkC + kC/ 
(r — ~1)(& ~ l)(rAT— - 1) 


( 10 . 2 ) 


In this formula T and R are the totals for the treatment and replicate 
that contain the missing value, G is the grand total, and C is the C value 
(defined in section 10.28) for the block which contains the missing obser- 
vation. The only unfamiliar quantity is O'; 

« 

O' = total of the O values for all blocks which contain the treat- 
ment that has the missing value 


Example. For the experiment analyzed in section 10.28, suppose that 
3 observations are missing: treatments (1) and (12) in replication I and 


TABLE 10.5 Summary totals for a 5 X 5 simple lattice with three 

MISSING VALUES 


Treatment totals 


(1) 24 -far 

(2) 

28 

(3) 

21 

(4) 

25 

(5) 

21 

(6) 29 

(7) 

23 

(8) 

16 

(9) 

23 

(10) 

23 

(11) 41 

(12) 

14 -f y 

(13) 

19 

(14) 

9 +2 

(15) 

36 

(16) 29 

(17) 

27 

(18) 

25 

(19) 

22 

(20) 

30 

(21) 22 

(22) 

38 

(23) 

23 

(24) 

37 

(25) 

33 


Rep. I 





Rep. II 



B 


C 



B 


C 


26 + x 


67 — y 



80 


-15 -f x 


61 


-8 



80 


-30 -f y 


47 + y 


25 - y + 

z 


56 


-8 


74 

- 

-15 



69 + z 


—2 — z 


68 


17 



87 


-31 



Rl = 276 + x + V 86 - x - y + * R 2 = 362 *f z -86 + x + y - 2 
G — Ri + R 2 « 638 -f x + y + z 
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treatment (14) in replication II. Denote the estimates of the 3 missing 
values by x , y, and z, respectively; these will be obtained by the method 
of successive approximation. 

It is best to start by finding all block, replicate, and treatment totals 
and all C values, as in steps 1 and 2 of the computing instructions for 
this design. Wherever a missing observation is involved, include the ap- 
propriate x, y , or z in its place. The summary totals are shown in table 
10.5. Note that the total of the C values over the whole experiment is 
identically zero, so that a good check is available on this part of the cal- 
culations. 

We must now find first approximations for y and z. In the replications 
in which they are present, fcheir values are 14 and 9, respectively. How- 
ever, replication II appears to have higher values than replication I, and 
it is probably worth while to make an adjustment for replication effect. 
If we ignore the missing values, the means per plot are about 12 for 
replication I and 15 for replication II. Since y is missing in replication 
I, we subtract 3 from its value in replication II, giving 11 . For 2 we add 
3, giving 12 as the first approximation. Using y = 11, £ — 12, we now 
solve for ;r from formula (10.2). For r = 2, k — 5 the formula becomes 

25T - 2R + G - 10C + 5C" 

x = 

16 

Put y = 11, z = 12. From table 10.5 we find for x 

T = 24; R = 276 + 11 = 287; G - 638 + 11 + 12 - 661 
C = 67; G' = 67 - 15 = 52 
(25) (24) - (2) (287) + 661 - (10) (67) + (5) (52) 277 


Put x = 17 , z = IS. For y we have 

T - 14; R - 276 + 17 + 293; G - 638 + 17 + 12 - 667 
C = 25 + 12 - 37; C f = 25 + 12 - 30 -7 
(25) (14) - (2) (293) + 667 - (10) (37) + (5) (7) 96 


Put x = 17, y = 6. For 0 we have 

T — 9; R = 362; (7 * 6 38 + 17 + 6 - 661; C - -2 
C = -2 + 25 - 6 - 17 

(25) (9) - (2) (362) + 661 - (10)(— 2) + (5)(17) ^ 267 ^ 
3 ~ 16 16 ~ 
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This completes the first round. A second round leads to x = 18,2/ = 
5, z = 17, and there is obviously no need for further calculation. The 
rest of the analysis proceeds as usual except that 3 d.f. are omitted from 
the total s.s. and the intra-block error s.s. For approximate £-tests, see 
section 10.13* 

Experiments with repetitions of the basic design. If the basic design has 
n replicates and these are repeated p times to give r — np replications, 
the formula becomes 


(n - 1 )k 2 T + C n - 1 )rR + G - nkC + kC' - n 2 R' 

-i) " 


(10.3) 


All symbols have the same meaning as in (10.2) except that C and C f are 
now derived from the totals of groups of similar blocks, just as in the 
statistical analysis for this case in section 10.29. The new quantity R/ 
is defined as 


R ' = total of all replications that are similar to the replication con- 
taining the missing value 


Note that this total includes the replication with the missing value. 

Example. In the 5X5 simple lattice with 4 replications, we have n = 
2, r = 4. If treatment (1) is missing in the first replication, the reader 
may verify that 


T = 53; R = 283; G = 1393; C = 69; C 9 = 69 - 9 = 60 


This gives 


R! = 283 + 361 = 644 


(25) (53) + (4) (283) + 3393 - (1Q)(69) + (5) (60) - (4) (644) 

56 


844 

66 


= 16 


10,3 Rectangular Lattices 

10.31 Description. These designs were developed recently by Harsh- 
barger (10.8) for k(k + 1) treatments in blocks of k units. They form a 
useful addition to the square lattices described in previous sections, since 
the allowable numbers of treatments, 12, 20, 30, 42, 56, 72, etc., fall about 
midway between the allowable numbers for square lattices. The sta- 
tistical analysis is quite similar to that for simple and triple lattices, 
though it takes more time because the block adjustments are not so 
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simple as with square lattices. The new designs are less symmetrical 
than the square lattices, in the sense that there is a greater variation in 
the accuracy with which two treatment means are compared. It will 
be recalled that only two standard errors are required for 2-tests with the 
simple and triple lattice — one for two treatments that appear in the same 
block and one for two treatments that do not. The simple rectangular 
lattice requires four standard errors, while the triple rectangular lattice 
requires seven. For practical purposes it appears that these can nearly 
always be reduced to two. 

There are several ways of constructing the designs. One (suggested 
by G. S. Watson) is by means of a lathi square with (k + 1) rows and 
columns, in which every letter in the leading diagonal is different. When 
writing down the square we attach a number to all letters except those 
in the leading diagonal, as illustrated below for a 4 X 4 square. 

A Bl C2 D3 

D4 a S5 AG 

B7 A$ D <79 

<710 Dll A12 B 

In the first replication we place in a block all numbers that lie in the 
same row of the latin square, in the second replication all numbers that 
lie in the same column, and in the third all numbers that have the same 
latin letter. 


Block Rep. 1 Rep. II Rep. Ill 


1 

1 

2 

3 

1 

1 

8 

11 

i 

1 

5 

7 

2 

4 

5 

6 

2 

2 

5 

12 

2 

2 

9 

10 

3 

7 

8 

9 

3 

3 

6 

9 

3 

3 

4 

11 

4 

10 

U 

12 

4 

4 

7 

id 

4 

0 

' 8 

12 


The important feature of this arrangement is that no two treatments 
are in the same block more than once. The use of a latin square with 
different letters down the leading diagonal ensures that in the third 
replication the three numbers associated with any letter are all in dif- 
ferent rows and columns, and hence have not previously occurred to- 
gether in a block. 

Plans 10.10~10.16 were constructed by this method. By using the 
first 2 replications from any plan we obtain a rectangular lattice in 2 
replications, which by means of repetitions can be used for an experi- 
ment in 4, 6, 8, etc., replications. By using all 3 replications of the plan 
we have designs for 3, 6, 9, etc., replications. 
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Alternatively (as pointed out by S. S. Shrikhande), we may construct 
the designs from a balanced lattice with (k + l) 2 treatments. If the 
first replication is omitted, and if all treatments that appear in any one 
selected block in the first replicate are omitted in subsequent replica- 
tions, it is easy to verify that we generate a design for k{k + 1) treat- 
ments in which every block is of size k and in which no two treatments 
appear together more than once in the same block. This method gives 
any number of replicates up to k, though of course the method fails when 
no balanced lattice exists, as with 36 treatments. The discussion in this 
section is limited to simple and triple rectangular lattices. 

The method of randomizing is the same as for lattices (section 10.27). 

10.32 Statistical Analysis. In each plan, the three basic replications 
are described as the X, 7, and Z replications. Further, if we take any 
block in one replication and examine another replication, we find that 
there is one and only one block in the other replication that has no treat- 
ments in common with the chosen block. These two blocks will be called 
partners . Since it is important to be able to distinguish partners in the 
statistical analysis, each block is denoted in the plan by two symbols, 
e.g., 72, one to mark the replication and one to indicate the partners. 
All partners carry the same number; thus the partners of 72 are X2 and 
Z2. It is advisable to write the block labels in the notebook in which 
the original results are recorded. 

Example. The example is a triple rectangular lattice for 12 treatments 
in blocks of 3 (k = 3). Artificial data were assembled by taking true 
treatment effects as shown in table 10.6, and adding to them true block 
effects as given with each block in table 10.7. Thus the observation 7 

TABLE 10.6 Treatment effects in artificial data for a 3 X 4 
triple rectangular lattice 


Treatment 

True effect 

True treatment 
total 

1 

14 

56 

2 

7 

35 

3 

2 

20 

4 

0 

14 

5 

3 

23 

6 

11 

47 

7 

9 

41 

8 

16 

62 

9 

8 

38 

10 

1 

17 

11 

5 

29 

12 

6 

32 
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for treatment (10) in block X4 of tabic 10.7 is obtained by adding the 
true treatment effect, 1, to the true block effect, 6. Since the average of 
the true block effects comes out to be 50/12, or 14/3, the true treatment 
totals, over the 3 replicates, will be (3r + 14), where r is the true treat- 

TABLE 10.7 Plan and observations for a 3 X 4 reotangttlar lattice 
(Treatment numbers are enclosed in parentheses) 


Block 

True block 





Totals 

Adjustment 

symbol 

effect 



Rep. X 


« Ox 

factor 

Xi 

6 

(10) 

7 

(12) 12 

(11) 11 

30 

— 12 

-0.0 

XI 

2 

(2) 

0 

(3) 4 

(1) 10 

20 

20 

4” 3.4 

xs 

7 

(7) 

16 

(9) 15 

(,S) 23 

54 

_ 5 

-LG 

X2 

0 

(4) 

0 

(5) 3 

(0) 11 

14. 

30 

4-5.4 


127 33 





Rep. Y 




B 

Or 



74 

9 

(3) 11 

(6) 20 

(11) 

17 


48 

-32 


-4.6 

Y2 

3 

(1) 17 

(11) 8 

(8) 

10 


U 

10 


4-1.4 

73 

3 

(12) 9 

(2) 10 

(5) 

6 


25 

10 


+ 1.4 

71 

5 

(10) 6 

(4) 5 

(7) 

14 


25 

0 


-0.6 








142 

-12 






Rep. Z 




B 

Cz 



Z1 

4 

(8) 20 

(6) 15 

(12) 

10 


45 

4 


4-0.2 

Z2 

8 

(9) 16 

(10) 9 

(2) 

15 


40 

-16 


-3.8 

ZS 

1 

(11) 6 

(3) 3 

(4) 

1 


10 

10 


4*3.2 

Z4 

8 

(5) 11 

(1) 22 

(7) 

17 


50 

-28 


-3.8 








145 

-21 






Treatment totals 








1 2 3 

4 5 

6 

7 

8 

0 

10 11 

12 



Unadj. 

55 34 18 

6 20 

40 

47 

62 

48 

22 25 

31 



Adj. 

56 35 20 

14 23 

47 

41 

02 

38 

17 20 

32 



ment effect. These figures are shown in the column at the right in table 

10 . 6 . 

No intra-block error has been introduced into the data, which there- 
fore provide two checks on the method of analysis. First, in the analysis 
of variance the intra-block error s.s. should be found to be identically 
zero. Second, the adjusted treatment totals as found from the analysis 
should be exactly equal to the true treatment totals. 

The computing instructions given below apply to either the simple or 
the triple rectangular lattice without repetitions. The changes required 
when there are repetitions are given later. 
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1 . Find the block totals, />% the replication totals, the treatment totals, 
rui the grand total. 

2. For each block calculate 

C » total (over all replicates) of all treatments in the block — rB 
Tnw for block 1*3, 

C - IS + 40 + 48 - (3) (48) - -32 

ls usual, the total of the Cls is zero. 

3. Arrange the C values in a supplementary table so that partners ap~ 
ear in the same row. The row totals of this table, 8, give the sums of 
he C values for each set of partners. The column totals give values 
enoted by R(u 

TABLE 10.8 Supplementary table of C values 


Block 


symbol 

Cx 

Ur 

Cz 


\Cx 

XU r 

\Gz 

fj-8 

1 

20 

0 

4 

24 

4.0 

0.0 

0.8 

0.6 

2 

30 

10 

-10 

24 

6.0 

2.0 

-3.2 

0.6 

3 

- 5 

10 

19 

24 

-1.0 

2.0 

3.8 

0.6 

4 

— 12 

-32 

-28 

-72 

-2.4 

—6,4 

-5.6 

-1.8 

Totals (Hr) 

33 

— 12 

-21 

0 

6.6 

-2.4 

-4.2 

0.0 


4. The analysis of variance is now obtained. All sums of squares are 
3tmd in the usual way, except that for blocks, adjusted for treatments, 
diich is 

Ec 2 E/e , 2 E * 2 

{rk ~ k~ 1) r(k + l)(rk~ k -1) r(r - 1 )(ft + I ) (rk - k ~ 1) 
- (2«) 2 + (30) 2 + • • ■ + (28)* (33) 2 +(12) 2 +(21) 2 (24)*+ • • • +(72) 2 

15 " ' " 00 _ 120 

- 274.0 - 27.9 - 57.6 = 188.5 (10.4) 


TABLE 10.9 Analysis of variance 


Source of variation 

d.f. 


s.s. 

m.s. 

Replications 

(r -1) 

2 

15.5 


Treatments 

(k 2 +k- 1) 

11 

1067.0 


Blocks 

rk 

9 

188.5 

20.9 Jfy 

Intra-block error 

(r - 1)(P — 1) — A' 

r 13 

0.0 

0.0 E e 

Total 

(rk 2 +rk- 1) 

35 

1271.0 



lS anticipated, the intra-block error s.s. in table 10.9 is zero. 
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5. We now calculate the weighting factors used to obtain the adjusted 
treatment totals. Although a general formula can be given, it is simpler 
to present these separately for simple and triple designs. Two weighting 
factors are necessary.* 


Simple rectangular lattice 

X 


r{E h - E e ) 


r(k - 1 )E b + ( rk - 2k + r)E c 
ME i, - E e ) 


M = 


r(k + 1 )E h + (rk -2k- r)E e 
Triple rectangular lattice 
X = 


r(E b - E e ) 


r(2k - 1 )E b + (rk - 6k + r)E e 

ME b - E e ) 

M 2 r(k + 1 )E b + (rk - 6k - 2r)E e 

In this example E e is zero and we have 


X = 


(3)(5) 


= 0.2; 


(0.2) (3) 
(2) (3) (4) 


- 0.025 


(10.5) 

( 10 . 6 ) 

(10.7) 

( 10 . 8 ) 


6. Complete table 10.8 by adding the columns XCy, X Cy, XC#, and 
fxS. The adjustment for any block is 

XC - jiS 


where S is taken from the row in which C lies in table 10.8. Thus for 
block XI the adjustment is 

+4.0 - 0.6 « +3.4 


These adjustments are recorded on the plan in table 10.7. Note that 
the order in which blocks appear is different in tables 10.7 and 10.8; care 
must be taken to see that each adjustment is given to the appropriate 
block. 

7. Finally, each treatment total is adjusted by adding the adjustments 
for every block in which the treatment appears. For instance, the 
adjusted total for treatment (4) is 

6 + 5.4 — 0.6 + 3.2 = 14 

* Tins method of estimating the weights is slightly different from that given by 
llarshbarger. In the interests of simplicity we have used only the pooled mean 
square for blocks and the intra-block error m.s. for estimating X and fi. 
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Every adjusted treatment total will be found to be equal to the true 
treatment total as given in table 10.0. 

Statistical analysis for repetitions of the designs. When the basic design 
with n replications is repeated p times to give r = np replications, the 
changes required in the steps of the analysis are noted below. 

1. No change. 

2a. For any block there are (p — 1) other blocks that have exactly 
the same set of treatments. Arrange the totals for such similar blocks 
in two-way tables, as exemplified below. There are n such tables. 

X Blocks 

Repetition Total 

Block no. 1 2 • • * p (2 0) 

1 

2 

» + 1 ) 

Total 

2b. For each group of similar blocks calculate 
C = total (over all replicates) of all treatments in the group — n(J2 fi) 
3. No change. 

4a. The blocks s.s. now has two components. For component (a) ob- 
tain the interaction s.s. for each of the tables in step 2a, and add these 
sums of squares. Since each block total contains k observations, a 
divisor k is required for the analysis of variance. 

4b . The sum of squares for component b is 

Ec 2 _ ERc 2 Z>* 

r(nk 1) r(k + ij (nk - fc - 1) r(n - 1)(* + 1 )(nk - k ~~ 1) 

The separation of degrees of freedom in the analysis of variance is shown 
below. 



d.f. 

m.s. 

Replications 

0 - 1) 


Treatments 

(A? + k ~ 1) 


Blocks 

rk 

E b 

Component (a) 

(r — n)k 


Component (b) 

nh 


Intra-block error 

(r ~ l)ft 2 - 1) - fc 

Ee 


Total (r/c 2 -f r/c — 1) 
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5. The weighting factors in formulae (10.5)— (10.8) are unchanged. 
Note that Eh is the pooled mean square for blocks. 

6. Unchanged. 

7. Unchanged. 

As mentioned previously, in order to have 2-tests for every pair of 
treatments we would require to present 4 standard error formulae for 
simple rectangular lattices and 7 for triple rectangular lattices. With 
very little inaccuracy these can be reduced to 2, one for the case where 
the two treatments appear in a block and one for the case where this is 
not so. For the difference between two adjusted means the estimated 
error variances are: 

Simple rectangular lattice 

2 E c 

Two treatments in the same block: (1 -f- A — ;<) 

r 

2$ , 

Two treatments not in same block : — ~ (1 + 2A — 3,~. 


Average: 


2E e r 2 (k - 1) 


•■u 


k 2 + k-l 


(1 + x ~ *0 + 


F - k + 1 
lc 2 +k-l 


(1 +2X 


-Sri] 


*[ 


1 + 


Triple, rectangular lattice 


2 ~ (k* + k ~ l)pi 
{k 2 + k - 1) 


2 E* 


Two treatments in the same block: (1 + 2X — p) 

r 

2E e / 3 \ 

Two treatments not in same block: — • (1 -f 3X — - p) 


Average: 


2 E e f 3(/r - 1) 


f 3(/r 

U 2 -f 


+ A - 1 


(1 -f 2\ — p) -b ~2 


Jr - 2k + 2 


Jc z -f- k — 1 


v-i-)] 


In the example, where X = 0.2, p = 0.025, the factors multiplying the 
usual term 2 E e /r are 1.375 for two treatments in the same block and 
3 .562 for two treatments not in the same block. The factor in the aver- 
age variance is 3 .460. 


10.4 Cubic Lattices 

10.41 Description. These designs were produced by Yates for plant- 
breeding work in which selections are to be made from an unusually large 
number of varieties. The number of treatments must be an exact cube. 

* Owing to the fact that only two standard errors were used, this average is not 
quite equal to the average variance taken over all possible pairs, though it is very 
close to that value. 
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The most useful range comprizes 27, 64, 125, 216, 343, 512, 729, and 
1000 treatments. The size of block is the cube root of the number of 
treatments, i.e,, 3, 4, 5, 6, 7, 8, 9, and 10, respectively. Thus cubic lat- 
tices can accommodate a large number of treatments in a small size of in- 
complete block. The designs have been used, for example, in an ex- 
periment with 729 strains of pomkrosa pine seedlings (10,9) and an ex- 
periment with 729 soybean varieties (10.10). The number of replicates 
must be 3 or some multiple of 3. 

Since the*plans are easy to construct and since they occupy a con- 
siderable amount of space for the higher numbers of treatments, they are 
not reproduced here. To obtain a plan, the k? treatments are numbered 
by means of a three-digit code in which each digit takes all values from 
1 to h. For 27 treatments, the code is as follows: 


rp * 

Code 

T 

Code 

T 

Code 

1 

111 

4 

121 

7 

131 

2 

211 

5 

221 

8 

231 

3 

311 

6 

321 

9 

331 

10 

112 

13 

122 

16 

. 132 

11 

212 

14 

222 

17 

232 

12 

312 

15 

322 

18 

332 

19 

113 

22 

123 

25 

133 

20 

213 

23 

223 

26 

233 

21 

313 

24 

323 

27 

333 


* Treatment number. 

The same principle applies with a larger number of treatments. For 
the first k treatments, the last two digits are fixed at (11) while the first 
digit runs from 1 to k The next k treatments are coded by fixing the 
last two digits at (21) while the first digit again runs from 1 to k } and so 
on in a systematic manner until the final k treatments are reached, for 
which the last two digits have the fixed values (kk). 

Within each of the 3 replications, the /c 3 treatments are grouped into 
k 2 blocks, each of size k. In the first replication, the rule for this group- 
ing is to keep the last two digits constant within a block, allowing the 
first digit to take all values from 1 to k Thus, in the example above, the 
9 groups of treatments constitute the 9 blocks, block 1 containing the 
treatments (111), (211), and (311). 

To form blocks in the second replication, we keep the first and last- 
digits fixed within any block and give the second digit all values from 
1 to k With 27 treatments, the first block therefore contains (111), 
(121), and (131), the second block (211), (221), and (231), and the last 
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block (313) ? (323), and (333). In the third replication, the first and 
second digits are constant within each block. 

The composition of the blocks in the second and third replications is 
shown in table 10.10. 


TABLE 10.10 Second 

AND THIRD REPLICATES 

OF A CUBIC 

LATTICE WITH 


27 TREATMENTS 





Rep. II 



Block (1) 

(2) 

(3) 

(4) 

(5) 

Code T * 

Code T 

Code T 

Code T 

Code T 

111 1 

211 2 

311 3 

112 10 

212 U 

121 4 

221 5 

321 . 6 

122 13 

222 14 

131 7 

231 8 

331 9 

132 16 

232 17 

(6) 

(7) 

(8) 

(9) 


Code T 

Code T 

Code T 

Code T 


312 12 

113 19 

213 20 

313 21 


322 15 

123 22 

223 23 

323 24 


332 18 

133 25 

233 26 

333 27 




Rep. Ill 



Block (1) 

(2) 

(3) 

(4) 

(5) 

Code T 

Code T 

Code T 

Code T 

Code T 

111 1 

211 2 

311 3 

121 4 

221 5 

112 10 

212 11 

312 12 

122 13 

222 14 

113 10 

213 20 

313 21 

123 22 

223 23 

(6) 

(7) 

(8) 

(9) 


Code T 

Code T 

Code T 

Code T 


321 6 

131 7 

231 8 

331 9 


322 15 

132 16 

232 17 

332 18 


323 24 

133 25 

233 26 

333 27 


* Treatment number. 






The original treatment numbers are shown as well as the code num- 
bers. In an experiment it is simplest to record only the code numbers, 
which are required in order to follow the computing instructions. 

Cubic lattices may be expected to be most useful when the number of 
treatments exceeds 100. The designs for k = 3, 4, 5, and 6 lie in the 
range which is also covered by lattice designs. We do not know of any 
investigations of the relative accuracy of lattice and cubic lattice designs 
in these cases. With highly variable experimental material, the use of a 
smaller block might give the cubic lattice some advantage; on the other 
hand the statistical analysis is more laborious owing to the greater xuim- 
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her of block adjustments to be calculated. Like the lattice designs, 
cubic lattices cannot be appreciably less accurate than randomized 
blocks which occupy the same set of replications, and can be analyzed 
by the method for randomized blocks. 

Some cubic lattices can also be arranged in quasi-latin squares (10.11) 
so as to allow the elimination of two sources of error variation. 

10.42 Arrangement of Experimental Material. The most important 
rule is to have units in the same incomplete block as homogeneous as the 
experimental material permits; in field trials the blocks should be com- 
pact in shape. If practicable, blocks within the same replicate should 
also be similar to each other, so as to increase the accuracy of inter-block 
comparisons and to obtain more precise results if the analysis is sub- 
sequently carried out by the method for randomized blocks. 

10.43 Randomization. The steps are: 

1. Randomize the order of the blocks independently within each rep- 
lication. 

2. Randomize the positions of the treatment code numbers inde- 
pendently within each block. 

3. Assign treatments to code numbers at random. 

10.44 Statistical Analysis. The following reference should be con- 
sulted. 

(10.12) Yates, F. The recovery of inter-block information in variety 
trials arranged in three-dimensional lattices. Ann. Eugen . 9, 
136-150, 1939. 

General formulae are given with a nmnerieal example of the analysis for 
an experiment with 64 treatments. In following this example, the 
reader should note that the experiment was not arranged in separate 
replications; the changes introduced for this reason are pointed out by 
Yates (pp. 147-148). The method of analysis when 6 or 9 replicates are 
used is also indicated (p. 144). Reference (10.13) describes the analysis 
of an experiment with 125 treatments. 

10.45 Error Variances. The formulae for the error variance of the 
difference between two treatment means are summarized below. In the 
analysis as described by Yates (10.12), the symbols 

w — w f w — w f 

w + 2 m r M 2w + w f 
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play a prominent part. It is convenient to write X 7 « X/'/r, / = p/k 2 . 
The error variances for the different types of treatment comparisons are 
as follows : 


Comparison 

Variance of difference between means 

fen — fell 

2 Ee 

— U + 2\' + 2 (k - 1)m'1 

fe.22 — fen 

2pj 

— [1 + 4\' + (3 k - 4)ju'| 

fe22 — fell 

*2E 

— IH- ex' + 3(fr - 2)/i'| 

r 

Average 

2 Ee r /r 1 

tT + (*=+* + !) 


Here E e is the intra-block error m.s. and r the number of replicates 
(3, 6, or 9). The formulae, though written in a slightly different form, 
are identical with those given by Yates. Our h corresponds to Yates's 
p, and our r to his ft. 
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Plan 10 .1 3X3 balanced lattice 
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Plan 10.2 
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T 
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16 






(15) 

5 

10 

3 

16 
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13 
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3 
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9 
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15 

4 
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Plan 10.3 5X5 balanced lattice 

t « 25, k - 5, r « 6, b - 30, X * 1 


Block Rep. I Rep, II Rep, III 


a) 

T 

2 

3 

4 

5 

(6) 

3 

6 

11 

16 

21 

(id 

1 

7 

13 

19 

~25 

(2) 

6 

7 

8 

9 

30 

(7) 

2 

7 

12 

17 

22 

(12) 

21 

2 

8 

34 

20 

(3) 

11 

12 

13 

14 

15 

(8) 

3 

8 

13 

18 

23 

(13) 

16 

22 

3 

9 

15 

(4) 

16 

17 

18 

19 

20 

(9) 

4 

9 

14 

19 

24 

(14) 

11 

17 

23 

4 

10 

(5) 

21 

22 

23 

24 

25 

(10) 

5 

10 

15 

20 

25 

(15) 

6 

12 

18 

24 

5 



Rep. IV 




Rep. 

V 




Rep. VI 


(16) 

1 

12 

23 

9 

15 

(21) 

~T 

17 

8 

24 

le 

(26) 

1 

22 

18 

14 

15 

(17) 

16 

2 

33 

24 

10 

(22) 

ii 

2 

18 

9 

25 

(27) 

6 

2 

23 

19 

15 

(18) 

6 

17 

3 

14 

25 

(23) 

21 

12 

3 

19 

10 

(28) 

11 

7 

3 

24 

20 

(19) 

21 

7 

18 

4 

15 

(24) 

6 

22 

13 

4 

20 

(29) 

16 

12 

8 

4 

~25 

(20) 

11 

22 

8 

19 

5 

(25) 

16 

7 

23 

14 

5 

(30) 

21 

17 

13 

9 

5 


* The symbol X denotes the number of times that two treatments appear in the 
same block* 
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Plan 10.4 7X7 balanced lattice 

t = 49, k = 7, r = 8, 6 = 50, X = 1 
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Plan 10.5 8X8 balanced lattice 

t = 04, k = 8, r = 9, b = 72, X = 1 


Block Rep. I Rep. II 
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Plan 10.5 {Continued) 8X8 balanced lattice 
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Plan 10.6 (i Continued ) 9X9 balanced lattice 
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Plan 10.7 6X6 triple lattice 
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Plan 10.9 12 X 12 quadruple lattice 
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Plan 10.9 ( Continued ) 
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6X7 rectangular lattice 
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Plan 10,16 9 X 10 rect angular lattice 
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CHAPTER 11 


BALANCED INCOMPLETE BLOCK’S 
11.1 Description 

The balanced lattices described in the last chapter are a particular 
group of a general class of designs known as balanced incomplete blocks. 
All these designs have the property that any pair of treatments appears 
together equally often within some block. Thus in plan 11.1 every pair 
of treatments appears together once in the same block, in plan 1 1 .2 twice, 
and in plan 11.5 four times. This property insures that the same stand- 
ard error may be used for comparing every pair of treatments; it also 
facilitates the statistical analysis, since any treatment total is adjusted 
in a single operation for all the blocks in which the treatment appears. 

The construction of the designs presents interesting mathematical 
problems, (11 .1), (11.2), (11.3). Although a design can be found for any 
number of treatments, t, and any size of block, ft, most of these are of no 
interest for our purpose, since they require too many replications. The 
plans in this chapter have been restricted to those in which the number 
of replicates does not exceed 10. 

The balanced lattices are the particular set of designs for which the 
relation i — ft 2 holds. They v also have the property that the blocks can 
be grouped in separate replications. The majority of the balanced in- 
complete block arrangements do not possess this property, which is pos- 
sible only when t is a multiple of ft. A few designs can be sorted into 
groups which comprize two or three replicates each. 

In most plans, the block contains six or fewer units. Accordingly, the 
designs are adapted for experiments in which the appropriate size of 
block is small. They have been applied in greenhouse pot experiments, 
where the block is restricted to the width of the bench ; in experiments 
on plant virus diseases, where the block consists of a small number of 
leaves on each plant; in experimental cookery where there is a limited 
number of stoves, in tests of mosquito repellents, reference (11.4), where 
the block consists of the two exposed arms of a subject; and in 
nutritional experiments where each child constitutes a block, 3 different 
foods being given during the 3 terms of the school year. 
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BALANCED INCOMPLETE BLOCKS 


11.1 


Examples from field experimental work occur in blueberry fertilizer 
trials where, owing to the uneven growth of different bushes, only a small 
number of them can be grouped into a homogeneous block. A similar 
consideration may limit the size of block in experiments on the control 
of fruit pests, where the tree constitutes the experimental unit. 

A few types of factorial design — the 5 X 2, 5 X 3, 7 X 3, and 7 X 4 — 
can be arranged in balanced incomplete blocks with the result that main 
effects and interactions are confounded to the same extent. It happens 
that in these cases there is no suitable method of confounding the inter- 
actions alone. 


11,2 Comparisons with Other Designs 

Much experimental material is of the type exemplified above, in that 
the natural grouping for use as a block accommodates only a small num- 
ber of units. Where the number of treatments exceeds the number of 
units in a block, there are three alternatives to the use of a balanced 
incomplete block design. (1) A common treatment is placed in each 
block, so as to provide a basis for comparison between treatments which 
are in different blocks. (2) The treatments are divided into groups such 
that the number of treatments per group is equal to the number of units 
per block. The groups are compared in a split-plot arrangement (chap- 
ter 7). (3) Complete replicates are assembled by the grouping of units 
from more than one block, in order that a randomized block design may 
be used. The merits of the first two alternatives were discussed in sec- 
tion 9.5. 

The relative efficiency of randomized blocks and balanced incomplete 
blocks has been discussed by Yates (11.5). Balanced incomplete block 
designs which are arranged in replications cannot be appreciably less 
accurate than randomized blocks. With balanced incomplete blocks 
which cannot be arranged in replications, a comparison with randomized 
blocks is not easy except by the use of uniformity data. Certain con- 
clusions can be drawn from the nature of the experimental material. If 
there is some association among the members of different Modes, it may 
be possible to form replicates which are only slightly less homogeneous 
than the blocks. For example; suppose that in an animal experiment the 
litter is the natural block. If the responses of the animals are influenced 
by their genetic constitution and if replications are constructed from 
closely related litters, the experimental error may be only slightly greater 
within a replication than within a block. In this event randomized 
blocks may be the more accurate layout. 

Sometimes the investigator has no information about any associations 
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between the groups which constitute the blocks; he knows merely that 
the experimental responses are likely to differ from group to group. In 
these circumstances Yates (1 1.5) has shown that incomplete blocks (with 
the recovery of inter-block information) are more accurate on the aver- 
age except when the blocks are ineffective. There is the additional ad- 
vantage that the block differences can be measured as a guide to the de- 
sign of future experiments, whereas with replications composed of units 
from several blocks it would he difficult to estimate the block effects. 

It is worth noting the efficiency factor (E) of any balanced incomplete 
block design whose use is under consideration. This factor, which is shown 
as a decimal fraction in table 11.3 (p. 327) and in the plans, is a lower 
limit to the efficiency of balanced incomplete blocks relative to ran- 
domized blocks. With 37 treatments in blocks of 9 units, for instance, 
the efficiency factor is 91%, so that the loss of efficiency could not exceed 
9%. The loss equals 9% only in the rather unlikely situation in which 
the replications of 37 units are as uniform as the blocks of 9 units, yet the 
variation among blocks is large compared to that within blocks. With 
any reasonable prospect that the blocks are more homogeneous than 
the replicates, the incomplete block design is to be preferred in this 
case. On the other hand, with 7 treatments in blocks of 2 units, the 
efficiency factor is only 58%. In this case incomplete blocks will be 
more accurate only if there is a substantial reduction in error variation 
because of the reduction in block size. 


11.3 Arrangement of Experimental Material 

Units within the same incomplete block should be as homogeneous as 
possible. For those designs which can also be grouped in replications, it 
is advisable to place similar blocks, so far as these can be discerned, 
within the same replicate. 

Certain of these designs can be rearranged to form an incomplete latin 
square (chapter 13) in which variation associated with two types of 
grouping can be eliminated from the experimental errors. The experi- 
menter should consider whether advantage can be taken of this double 
control so as to reduce the error. 

11.4 Randomization 

The steps are: 

1. Rearrange the blocks at random. (If the design is arranged in 
complete replications, the blocks are randomized only within each repli- 
cation and the replications are kept separate.) 
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2. Randomize the positions of the treatment numbers within each 
block. 

3. Assign treatments at random to the numbers in the plan. 

11.5 Statistical Analysis 

In order to present the shortest computational methods, it is ad- 
visable to divide the plans into five types, which are shown in the index 
at the end of this chapter. 

11.51 Type I. Designs Arranged in Replications. We will analyze 
an example with 6 treatments in blocks of 2 (plan 11.3). The experiment 
was conducted by Dr. Pauline Paul at Iowa State College (11.6). Its 
object was to compare the effects of length of cold storage on the tender- 
ness and flavor of beef roasts. Six periods of storage (0, 1, 2, 4, 9, and 
18 days) were tested: these are denoted by treatment symbols 1, 2, 3, 
* • • , 6, respectively. 

Thirty roasts from the round of an animal were used. Four muscles 
each provided 6 roasts, while 3 muscles each provided 2 roasts. The 
roasts on any muscle group themselves naturally into pairs, since to each 
roast on the left side of an animal there corresponds another roast on 
the right side. From previous experience it was believed that the 2 
roasts in any pair would give closely similar results. Variation among 
different pairs from the same muscle was expected to be somewhat larger, 
and variation among muscles to be still larger. 

These opinions prompted the use of a design in blocks of 2, each block 
comprizing the left and right roasts in a pair. When grouping the blocks 
into replications, it was natural to put roasts from the same muscle into 
the same replicate. With the first 4 muscles a separate replicate could 
be made from each muscle. The remaining replication consisted of the 
3 smaller muscles. 

The plan and the scores for tenderness are given in table ILL Scoring 
was done by 4 judges, each marking on a scale from 0 to 10. The scores 
shown are their totals (out of 40), a high score indicating very tender 
beef. 

The following symbols are used: 2 = 6 = the number of treatments; 
k = 2 «= number of units per block; r = 5 = number of replications; 
b = 15 = number of blocks. 

1. Find the block totals, B 1 the replicate totals, the grand total, (?, 
and the treatment totals, T , placed below the plan. 
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TABLE 11.1 Scores for tenderness of beef 


Rep. I 


Rep. II 



Rep. Ill 



B 



B 



B 

0) 7 

(2) 17 

24 

(1) 17 

(3) 27 

44 


(t) J0 (4) 25 35 

(3) 26 

(4) 25 

51 

(2) 23 

(5; 27 

50 


(2) 20 (0) 37 03 

(5) 33 

(«) 29 

62 

(4) 29 

(6) 30 

59 


(3} 24 (5) 20 50 



137 



153 



148 
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(1) 25 

(5) 40 

65 

(1) 11 

(6) 27 

38 




(2) 25 

(4) 34 

59 

(2) 24 

(3) 21 

45 




(3) 34 

(6) 32 

66 

(4) 26 

(5) 32 

58 
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means 
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70 

206 

—66 


19 

71.8 

14.4 

1 

2 

115 

241 

-11 


24 

117.3 

23.5 

2 

3 

132 

256 

8 


17 

133.6 

26.7 

4 

4 

139 

262 

16 


15 

1-40.4 

28.1 

9 

5 

158 

285 

31 

- 

-24 

155.7 

31.1 

18 

6 

155 

2 88 

22 

- 

~5i 

150.2 

30.0 


G 

- 769 

1538 

0 


~ 0 

769.0 



2. For each treatment, calculate the following quantities: 

B t — total of all blocks in which the treatment appears 
Q - kT - B t = 2 T - B t 

W « (t~ k)T - (t - J)B, + (k - 1)0 » 4 T ~ 5. B t + 0 

The total of the Q’s and the W’s should each be zero, 

3. The analysis of variance is as follows: 


Replications 

(r 

d.f. 

- D 

4 

s.s. 

298.5 

m.s. 

Treatments (unadj.) 

a 

- 1) 

5 

1059.8 


Blocks within reps, (adj.) 

Kb 

- r) 

10 

213.4 

21.34©, 

Intra-block error 

(tr — t 

— b + 1) 

10 

77.3 

7.73 E e 

Total 

(tr 

- 1) 

29 

1649.0 
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The only sum of squares requiring special instructions is that for blocks 
within replications, adjusted for treatment effects. Since we do not 
know any simple way of finding this directly, it is obtained by a round- 
about method. Calculate in the usual way the unadjusted sum of squares 
for blocks within replications: this comes to 753.0. Then compute the 
sum of squares for treatments, adjusted for blocks, which is 


(1-1)SQ 2 _ 5 

lctr(k - 1) (2)(6)(5)(l) 


• [(66) 2 + (ll) 2 +• • •+ (22) 2 ] = 520.2 


The adjusted sum of squares for blocks is then given by the following 
combination of sums of squares 


Blocks (unadj.) + treatments (adj.) — treatments (unadj.) 

753.0 + 520.2 - 1059.8 = 213.4 


4. Calculate the weighting factor 

r(g 6 - E t ) 

A " rt(k - l)E b + k(b-r-t + 1 )E e 
5(21.34 - 7.73) 

= - -- - - = 0.09484 

(30) (21. 34) + (10) (7.73) 

The adjusted treatment totals are 

T + ixW 

and the means per unit are found on division by r. 

5. The effective error variance per unit is estimated as 

E e [ 1 + (t - kb] « 7.73[1 + (4) (0.0948)] = 10.66 

We derive ^tests from this figure by the ordinary rules. For instance, 
th e standar d error of the difference between two adjusted totals is 
V2r(10.6G), or 10.3 2; for the diff erence between two adjusted 'means the 
standard error is ’V (2)(10.06)/r, or 2.06. 

6. If an F-test is desired, calculate the sum of squares of the adjusted 
treatment totals. This figure, 4718.0, is divided by r(i — 1), or in this 
case 25, in order to find the mean square on a unit basis. The mean 
square, 188.7, is compared with the effective error m.s„ 10.66, which is 
assigned (tr — £ — 6 + 1), or 10, di. The F-ratio, 188.7/10.66, is highly 
significant. The test is not exact, because it ignores the effects of errors 
in the weighting factor 

In an experiment of this type, interest centers on the shape of the 
response curve rather than on comparisons between pairs of treatments. 
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The adjusted means indicate the common “diminishing returns” curve. 
An exponential of the form y = A ~ Be~~ c \ where t is the time of stor- 
age, fits the data satisfactorily. A parabolic regression on t, which is 
easier to compute, will be found to be inadequate. The conclusion is 
that storage up to about a week increases tenderness. 

7. The gain in precision over randomized blocks is estimated as fol- 
lows. By pooling the sums of squares for blocks within replications and 
intra-block error, we obtain an estimate of the error m.s, that would have 
applied with randomized blocks. This figure, 11. 54, is compared with 
the effective error m.s., 10.60. The relative precision is estimated as 
14.54/10.66, or 336%. Since the randomized block design has 20 
error di. as against 10 for the design actually used, this estimate should 
be reduced to 126% by the adjustment described in section 2.31. 

11.52 Type II. Designs Arranged in Groups of Replications. Only 
two of the plans are of this type, plans 31.0 and 11.13, and in each case 
the group consists of two replications. (Plan 1 1 .2 is also in this class, but 
is better analyzed as shown in section 1 1.55.) The analysis is very sim- 
ilar to that in the previous section. Two changes should be noted. If 
c is the number of groups, so that c — r/2 for the plans presented here, 
the degrees of freedom in the analysis of variance subdivide as shown 
below. 



d.f. 

m.s. 

Groups 

(c - 1) 


Treatments (unadj.) 

(t - 1) 


Blocks within groups (adj.) 

(6~c) 

Eb 

Intra-block error 

(tr — t — b + 3) 

X, 


Total (tr - 1) 

The second change is that the weighting factor y becomes 

~ (5 - c){E b - e c ) 

11 l(k - 1 )(& - e)E b -Ht- h)(b - c- t + 1 )E e 

11.53 Type III. Designs Not Arranged in Replications or Groups of 
Replications. Here again the method of section 11.51 may be followed 
except that no replication effects appear. The analysis of variance now 
reads as follows. 

di. m.s. 

Treatments (unadj.) (t ~ 1) 

Blocks (adj.) (& - 1) E b 

Intra-block error (tr — t ~~ b + V) E c 


Total 


(tr - 1) 
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As before, the sum of squares for blocks (adjusted) is found by a sub- 
traction process. The weighting factor become^ 

(b - 1 )(E b - E c ) 

M ~ t(k - 1)(& - i)ir+ir^oor^i)^ 

Otherwise the method of section 11.51 applies. Of course, the gain in 
accuracy relative to randomized blocks cannot be estimated. A worked 
example with t = 9, k - 4, has been given by Yates (11.5, 11.7). His 
method of calculating the adjusted sum of squares for blocks is slightly 
different from ours, since he divides this sum of squares into two com- 
ponents so that the calculation becomes partially self-checking. Results 
should be identical by either method. 

11.54 Type IV. Experiments with t = 6. These designs, which can- 
not be arranged in replications, are covered by the computing instruc- 
tions given in the previous section. However, the analysis simplifies 
slightly when t = 6, because the sum of squares for blocks (adjusted) 
can be found directly. The data in table 11.2 are from an experiment on 
corn hybrids with t = b = 13, r = k = 4. For field experiments a de- 
sign that can be laid out in separate replications is usually preferred. 
In this experiment, conducted in 1943 by the North Carolina Agricultural 
Experiment Station, it was desired to compare 10 hybrids, all potentially 
suitable for commercial use, at a number of places throughout the state, 
and at each place to test in addition 3 standard varieties that were 
adapted to that place. With 13 treatments no design that can be put into 
replications is available, and since 4 was a convenient number of replica- 
tions, the design with t — 13, r = k = 4 was chosen. Randomized 
blocks might have been equally good. Incidentally, the plan used 
is different from that given in plan 11.22, a different method of con- 
struction having been employed. Both plans have the same structural 
properties. 

1. Find the block totals, B } the treatment totals, T, and for each 
treatment find the total B t of all blocks in which the treatment appears. 
Time is saved if T and Bt are found simultaneously. The B t values should 
add to h times the grand total G. 

2, For each treatment find 

W « (t - k)T ~ (t- 1 )B t + (jfe - 1)0 
= 9 !T — 12 Bf+ 3 G (in this example) 


The IWs should add to zero. 
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TABLE 11.2 Plan and yields of corn (pounds per plot) 


Block 

Block totals 


1 

(3) 

25.3 

(6) 

19.9 (9) 

29.0 (11) 

24.0 

98. 8 

2 

(3) 

23.0 

(4) 

19.8 (8) 

33.3 (12) 

22,7 

98.8 

3 

(10) 

16.2 

(ID 

19.3 (12) 

31 .7 (13) 

26.0 

93.8 

4 

(2) 

27.3 

(5) 

27.0 (8) 

35.6 (11) 

17.4 

107.3 

5 

(7) 

23.4 

(8) 

30,5 (9) 

30.8 (10) 

32.4 

117.1 

6 

(4) 

30.6 

(5) 

32.4 (6) 

27.2 (10) 

32.8 

123.0 

7 

0) 

34.7 

(5) 

31.1 (9) 

25.7 (12) 

30.5 

122.0 

8 

(3) 

34.4 

(5) 

32.4 (7) 

33.3 (13) 

36,9 

137.0 

9 

CD 

38.2 

(2) 

32.9 (3.) 

37.3 (10) 

81.3 

139.7 

10 

(2) 

28.7 

(4) 

30.7 (9) 

26.9 (13) 

35.3 

121.6 

11 

(1) 

36,6 

(4) 

31.1 (7) 

31.1 (11) 

28.4 

127.2 

12 

0) 

31.8 

(0) 

33.7 (8) 

27.8 (13) 

41.1 

134.4 

13 

(2) 

30.3 

w 

31.5 (7) 

39.3 (12) 

26.7 

127.8 








1548.5 


No. 


T 

Bt 

\r 

t 4 - Mir 



1 


141.3 

523.3 

-362.4 

136.7 



2 


119.2 

496.4 

-238.5 

116.2 



3 


120.0 

474.3 

33.9 

120.4 



4 


112.2 

470.6 

8.1 

112.3 



5 


122.9 

489. 3 

-120.0 

121.4 



6 


112.3 

484.0 

-151.8 

110.4 



7 


127.1 

509.1 

-319.8 

123.0 



8 


127.2 

457.6 

299.1 

131.0 



9 


112.4 

459.5 

143.1 

114.2 



10 


112.7 

473.6 

- 23.4 

112.4 



11 


89.7 

427.1 

327.6 

93.9 



12 


111.6 

442.4 

341.1 

115.9 



13 


139.9 

486. 8 

63.0 

140.7 




1548.5 

6194.0 

0 

1548,5 


. The analysis of variance is: 








d.f. 

S.S. 

m.s. 


Treatments (unacij.) 

(£ - 1) 

12 542.67 




Blocks (adj.) 


(6 - 1) 

12 475.27 

3 y.Gi.Ei, 


Intra-block error 

(tr - 2t + 1) 

27 538.21 

19.93E* 


Total 


(tr ~ J) 

51 1550.15 




The sum of squares for blocks (adjusted) is found directly as the sum 
of squares of the TP’s, divided by tr{t — k)(k — 1), or in this case 
(4) (13) (9) (3) = 1404. 
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4, The weighting factor y, simplifies to 

irzh 19 - 68 . op 127 

t(k - 1 )E t (13) (3)<39-61) 

The adjusted treatment totals are the quantities ( T + yW), 

5. As before, the effective error m.s. per unit is taken as 

E e [l + (t~~ k)y] - 19.93fl + (9) (0.01 27)] - 22.2 

Other features of the analysis are the same as in previous sections. Yates 
(11.5) presents an example with i = 21, h « 5. 


11.65 Type V. Small Experiments. In certain of the smaller de- 
signs, the numbers of degrees of freedom in the mean squares for blocks 
and intra-block error are rather small Consequently, the estimates of 
the relative weights assigned to inter- and intra-block comparisons are 
poor. In these eases, unless the whole plan has been repeated to secure 
extra replication, it is best to use the original method of analysis that 
Yates developed (11.8). This method adjusts the treatment means for 
differences between the blocks, but makes no use of inter-block informa- 
tion. The analysis is a little simpler than when inter-block information 
is used. 

For purposes of illustration, this analysis will be applied to the ex- 
ample in the previous section, with t = h = 13, r — k «= 4, though in 
practice inter-block information should be utilized in an experiment of 
this size. The data are presented in table 11.2. 

1. Find the block totals, B } and the treatment totals, For each 
treatment find the total B t over all blocks in which the treatment appears. 

2. For each treatment compute 

Q - IcT ~ B t - 4T - B t 

t - 1 12 

t f « m + — - -Q - m + — — —Q « m + G.G769Q 

tr(k - 1) (13) (4) (3) 

where m = 1548,5/52 — 29.78 is the mean of the whole experiment. 
The V values are the adjusted treatment means. The Q values should 
add to zero. 

3. The analysis of variance is shown below. 


Blocks (unadj.) 
Treatments (adj.) 
Intra-block error 

Total 


d.f. 

(b - 1) 12 

(t - 1) 12 

(ir - t - b + 1) 27 


(tr ~ 1) 51 


s.s. m.s. 

689.38 

328.55 27.38 

538.22 19.93#< 


1556.15 


Note that the blocks sum of squares is unadjusted . It is found in the 
usual way as the sum of squares of deviations of the B\ s, divided by k. 



11.56 


MISSING DATA 


32.5 


This rule applies even when the experiment is arranged in replications 
or groups of replications, though if the experimenter wishes, he may di- 
vide the blocks sum of squares into that due to replications (or groups) 
and that due to blocks within replications. 

The adjusted sum of squares for treatments is 


f ~ \ 19 

2 * - maw 5m 1(41 * + (1Mf + ' ' ' 

- 328.55 


As usual, the intra-block error s.s. is found )>y subtraction. 
4. For Ptosis, the effective error variance is 


E c 


l +■ 


t-h ' 
1 ). 


E t 


k(t - i) (19.93) (12) (4) 


' t{h - 1) 


(13) (3) 


24.5 


For the variance of the difference between two adjusted treatment 
means, we multiply this quantity by 2/r. 

5. The analysis above provides an exact F-test of the adjusted treat- 
ment means. The F-ratio is 27.38/19.93, with 12 and 27 di. 


11.56 Missing Data. For reasons given in section 10.13, it is recom- 
mended that the analysis of incomplete data be carried out by inserting 
for the missing observations values which minimize the intra-block 
error s.s. In our notation the formula, first given by Cornish (11.9), is 
_ tr(k - l)fl + k(t - 1 )Q - (f - 1 )Q ! 

(k ^T)^r(fc -iy^ k(t - 1)] 

where B is the total of the block containing the missing observation, Q is 
the Q value ( = kT — B t ) for the treatment that contains the missing 
value, and Q f is the sum of the Q values for all treatments that are in the 
block with the missing value. 

Example- 1. Jn the experiment on roast beef in section 1 1.51, suppose 
that treatment (5) in replication IV is absent. The first stop is to find 
the block totals, the treatment totals, the B t and the Q values as in the 
ordinary analysis. An x should be inserted for the missing observation. 
The data read as follows. 


Treatment 


number 

T 

Bt 

\ 

11 

©» 

1 

70 

166 + x 

—20 — x 

2 

115 

241 

-11 

3 

132 

250 

S 

4 

139 

262 

16 

5 

118 + x 

245 4- x 


6 

355 

288 

22 

Totals 

729 -f* 

1458 + 2x 

0 
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Since the block with the missing value contains treatments (1) and (5) 
Q' = -26 - 9 - -35 

Hence, since B is 25, 

30H + 10Q - 5 Q’ _ SB + 2Q-Q' 

X ~ 20 4 

(6) (25) + (2) (—26) - (-35) 

— — 4^5 

4 

On substituting this value in all places where an x occurs in the table 
above, we are ready to proceed with the analysis of variance. 

Example 2 . For an example with 2 missing observations, suppose 
that treatments (8) and (13) are missing from block 12 in the corn ex- 
periment in section 11.54. Denote the values to be inserted by x and y, 
respectively. To save space, the preliminary data are presented only 
for the treatments that occur in the same block as the missing observa- 
tions; these are treatments (1), (6), (8), and (13). 

Treatment 
number 
1 

6 
8 
13 


131.1 - Q ' 

B = 05.5 + * + y 


T B t 

141.3 454.4 + 

112.3 415.1 + 

99.4 + s 388.7 + 

98.8 4- y 417.9 + 




<0 

ii 

*<3 

i 

- B t 

X 

+ y 

110.8 - * 

- V 

X 

+ 1/ 

34.1 - x 

~ V 

X 

+ y 

8.9 + 3a - y 

X 


-22.7 - x 

+ 3*/ 


In the 3 replications in which it appears, the average for treatment (13) 
is 33 : this will be chosen as the first approximation to y . Then for x we 
have 

B * (55.6 + 33.0 - 98.5; Q = 8.9 - 33 - -24.1; Q f - 131.1 
Since t — 13, k = r ~ 4, 

1 5673 + 48 Q ~ 12 Q* 13B + 4 Q - Q f 

= (13) (68.5) + (4) (—24,1) - (131.1) = gg Q 
27 

We now substitute x = 39.0 in the preliminary data and obtain a 
second approximation to y. For this we have 

B = 65.5 + 39.0 = 104.5; Q = -22.7 - 39.0 = -61.7 
Qf = 131.1 
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Note that Q\ which does not involve x or y, remains fixed throughout the 
calculation. The second approximation to y is 

(13) (4 04.5) + (4) (—61,7) - (131.1) 

v = — ~ — — - - 36.3 

27 

The next 2 approximations will be found to be x «= 40.1, y *= 30.7, 
which may be used as the values to be inserted. 

The rest of the analysis proceeds as usual; the intra-block error de- 
grees of freedom are reduced by 1 for each missing observation. Tn the 
calculation of standard errors for Wests of the adjusted treatment 
means, the method given in section 16.13 should be followed. 
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TABLE 11.3 Index to plans 


t 

k 

r 

b 

At 

E 

Plan 

Type 

4 

2 

3 

6 

3 

.67 

11,1 

V 


3 

3 

4 

2 

.89 

* 

V 

5 

2 

4 

10 

1 

.62 

11.2 

V 


3 

6 

10 

3 

,83 

* 

111 


4 

4 

5 

3 

.94 

* 

V 

6 

2 

5 

15 

1 

.60 

11.3 

I 


3 

5 

10 

2 

.80 

11.4 

III 


3 

10 

20 

4 

.80 

11.5 

I 


4 

10 

15 

0 

.90 

11.0 

II 


5 

5 

6 

4 

.96 

* 

v 

7 

2 

6 

21 

1 

.58 

* 

in 


3 

3 

7 

1 

.78 

11.7 

Y 


4 

4 

7 

2 

.88 

11. S 

Y 


6 

6 

7 

5 

.97 

* 

V 
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TABLE 11.3 Index to plans ( Continued ) 


t 

k 

r 

b 

xt 

E 

Plan 

Type 

8 

2 

7 

28 

1 

.57 

11.9 

I 


4 

7 

14 

3 

.86 

11.10 

I 


7 

7 

8 

6 

.98 

* 

V 

9 

2 

8 

36 

1 

.56 

* 

III 


4 

8 

18 

3 

.84 

11.11 

III 


5 

10 

18 

5 

.90 

11.12 

III 


6 

8 

12 

5 

.94 

11.13 

II 


8 

S 

9 

7 

.98 

* 

IV 

10 

2 

9 

45 

1 

.56 

11.14 

I 


3 

9 

30 

2 

.74 

11.15 

III 


4 

G 

15 

2 

.83 

11.16 

III 


5 

9 

18 

4 

.89 

11.17 

III 


6 

9 

15 

5 

.93 

11 .18 

III 


9 

9 

10 

8 

.99 

* 

IV 

11 

2 

10 

55 

1 

.55 

* 

III 


5 

5 

11 

2 

.88 

11.10 

IV 


6 

6 

11 

3 

.92 

11.20 

IV 


10 

10 

11 

9 

.99 

* 

IV 

13 

3 

6 

26 

1 

.72 

11.21 

III 


4 

4 

13 

1 

.81 

11.22 

IV 


9 

9 

13 

6 

.96 

11.23 

IV 

15 

3 

7 

35 

1 

.71 

11.24 

I 


7 

7 

15 

3 

.92 

11,25 

IV 


8 

8 

15 

4 

,94 

11.26 

IV 

16 

6 

6 

16 

2 

.89 

11.27 

IV 


6 

9 

24 

3 

.89 

11.28 

III 


10 

10 , 

16 

6 

.96 

11.29 

IV 

19 

3 

9 

57 

1 

.70 

11.30 

III 


9 

9 

19 

4 

.94 

11.31 

IV 


10 

10 

19 

5 

.95 

11.32 

IV 

21 

3 

10 

70 

1 

.70 

11.33 

I 


5 

5 

21 

1 

.84 

11.34 

IV 


7 

10 

30 

3 

.90 

11,35 

III 

25 

4 

8 

50 

1 

.78 

11.36 

III 


9 

9 

25 

3 

.93 

11.37 

IV 

28 

4 

9 

63 

1 

.78 

11.38 

I 


7 

9 

36 

2 

.89 

11.39 

III 

31 

6 

6 

31 

1 

.86 

11.40 

IV 


30 

10 

31 

3 

.93 

11.41 

IV 

87 

9 

9 

37 

2 

.91 

11.42 

IV 

41 

5 

10 

82 

1 

.82 

11,43 

III 

57 

8 

8 

57 

1 

.89 

11,44 

IV 

73 

9 

9 

73 

1 

.90 

11.45 

IV 

91 

10 

10 

91 

1 

.91 

11.46 

IV 


* These plans are constructed by forming all possible combinations of the t numbers 
in groups of size k. The number of blocks 6 serves as a cheek that no group has been 
missed. 

f Number of times that two treatments appear together in the same block. 



PLANS 


PLANS 


Plan 11.1 t = 4, /r = 2, r = 

= 3, 6 = 

6, X = 1, E 

“ 0.67, Type V 

Block Rep. I 


Rep. II 

Rep. Ill 

(1) 1 2 

(3) 

1 3 

(5) 1 4 

(2) 3 ~ 4 

(4) 

2 4 

(6) 2 3 

1° 


Plan 11 J l = 5, k = 2, r = i, b = 10, X = 1, E - 0.62, Type V 



ltaps. I 


Reps. Ill 

Mock 

and 11 


and IV 

0) 

1 2 

(6) 

1 4 

(2) 

3 4 

(7) 

2 3 

(3.) 

2™ ~5 

(3) 

3 5 

(4) 

I 3 

(9) 

1 5 

(5) 

4 5 

(10) 

2 4 


/'fan //./>' t = 0, A- =2 , ;• = 5, ft = 15, X - 1, E = 0.60, Type I 


Block 

Rep. I 


Hop. 

Ti 


Rep. 

III 


Rep. IV 


Rep. V 

(1) 

1 

2 

(4) 

1 

3 

(7) 

1 

4 

(10) 

1 

5 

(13) 

1 

6 

(2) 

3 

4 

(5) 

2 

5 

(8) 

2 

6 

(11) 

2 

4 

(14) 

2 

3 

(3) 

5 

0 

(0) 

4 ~ " 

Js 

(») 

3 

5 

(12) 

3 

6 

(15) 

4 

5 


Plan 114 t ~ ti, I; ~ 3, r = 5, ft = 10, X = 2, E = 0.80, Type III 
Block 


(\) 

1 

2 5 

16) 

2 

3 

4 

(2) 

1 

2 6 

(7) 

2 

3 

5 

(3) 

i 

3 4 

(8) 

2 

4 

6 

(4) 

l 

3 0 

(9) 

3 

5 

6 

(5) 

i 

4 5 

(10) 

4 

5 

6 


Plan lift 

t = 0, k 

= 3, r 

= 10, 6 = 

20, X - 

4, E = 0.80, Type I 


Block 

Rep. I 


Rep. 11 


Rep. Ill 


Rep. IV 

(0 

12 3 

(3) 

1 2 4 

(5) 

1 2 5 

(7) 

1 2 6 

(2) 

4 5 & 

(4) 

3 5 6 

(6) 

3 4 6 

(8) 

3 4 5 


Rep. V 


R(*p. VI 


Rep. VII 


Rsp. VIII 

(9) 

1 3 4 

(ID 

r 3 5 

(13) 

1 3 6 

(15) 

1 4 5 

(10) 

2 5 5 

(12) 

2 4 6 

(14) 

2 4 5 

(16) 

2 3 6 


Rep. IX 


Rep. X 





(17) 

1 4 6 

(19) 

1 5 6 





(IS) 

2 3 5 

(20) 

2 3 4 
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Plan 11.6 t = 6, h - 4, r = 10, 6 » 15, X = 6, E = 0.90, Type II 


Block Iiops. I and II Reps. Ill and IV Reps. V an d VI 


(1) 1 

2 

3 

4 

(4) 

1 

2 

3 5 

3 

2 

3 

6 

(2) 1 

4 

5 

6 

(5) 

1 

2 

4 6 

(8) 1 

3 

4 

5 

(3) 2 

3 

5 

6 

( 6 ) 

3 

4 

5 6 

(») 2 

4 

5 

6 

Reps. VII and VIII 


Reps. IX and X 





(10) 1 

2 

4 

5 

(13) 

1 

2 

5 6 





(id i 

3 

5 

6 

(14) 

1 

3 

4 6 





(12) 2 

3 

4 

0 

(15) 

2 

3 

4 5 





Plan 11.7 


t « 7, k - : 

1, r = 3, 

b - 

= 7, X 

= 1 ,£■ 0.7S, Type V 





Block 


a) 

1 2 4 

(3) 

3 4 6 

(5) 5 6 1 

(7) 7 1 3 

(2) 
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Plan 1146 t « 91, k - 10, r - 10, 6 « 01, X « 1, J0 « 0.91, Type XV 
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CHAPTER 12 


LATTICE SQUARES 
12.1 Description 

12.11 Balanced Lattice Squares. The number of treatments must be 
an exact square. Within each replicate, the k 2 treatments are arranged 
on the plan in a kXk square. The method of grouping into rows and 
columns, which varies in successive replications, is such that the treat- 
ment means can be adjusted for differences among the rows and columns 
of each square. Thus, in addition to the elimination of differences among 
replicates from the experimental errors, the design permits a f ‘double 
control” within each replicate, similar to that obtained in a latin square. 

The principle which governs the grouping into rows and columns may 
be seen from the plans 12.1-12.8. With 9 treatments, for instance, treat- 
ment 1 occurs in the same row as treatments 2, 3, 6, and 8, and in the 
same column as treatments 4, 5, 7, and 9. Thus every other treatment 
appears with treatment 1 either in the same row or in the same column. 
More generally, any pair of treatments occurs once in the same row or in 
the same column. 

This property holds for all plans having an odd number of treatments, 
i.e., for 9, 25, 49, 81, 121, and 169 treatments. The number of replicates, 
for A 2 treatments, is (k + l)/2. The standard error of the difference is 
not exactly the same for all pairs of treatments, though the variation in 
accuracy is small. Separate formulae are given for the two standard 
errors for cases in which they may be needed. 

A design with twice the basic number of replicates (e.g., 25 treatments 
in 6 replicates) is obtained by repeating the plan, the roios being inter- 
changed with the columns , In designs so produced, every pair of treat- 
ments occurs together once in the same row and once in the same column. 
On account of this property, the standard error of the difference is the 
same for all pairs of treatments, whether the row and column differences 
are large or small. 

Within the most useful range, the only even numbers of treatments 
which provide lattice square designs are 16 and 64. These designs have, 
respectively, 5 and 9, i.e. (k + 1) replicates. Every pair of treatments 
occurs once both in the same row and in the same column. 

The available selection of designs (up to 12 replicates) is su mm arized 
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12.12 PARTIALLY BALANCED SQUARES 347 

in table 12.1. By analogy with balanced lattices, the designs may be 
called balanced lattice squares. 

TABLE 12.1 Available numbers op treatments and replications for 

BALANCED LATTICE SQUARES 

Number of treatments 9 16 25 49 64 81 121 169 

Number of replicates 4,8 5,10 3,6 4,8 9 5,10 6,12 7 

For 16 treatments in 10 replicates, the basic plan is used twice. Al- 
though the basic plan (plan 12.1) requires only 2 replicates, the design 
for 9 treatments is not recommended with less than 4 replicates. Even 
in this case there are only 8 d.f. for estimating row and column variances 
and only 8 d.f. for error. Little would be gained over randomized blocks 
unless the extra controls were highly effective. A design with 9 treat- 
ments and 8 replicates is found by repeating the design for 9 treatments 
and 4 replicates. 

Certain factorial experiments can be arranged in balanced lattice 
squares; e.g., 3 X 3, 8 X 2, 4 X 4, 4 X 2 X 2, 2 4 , 5 X 5, 7 X 7, 8 X 8, 
4 3 , 2°, 9X9, 3 4 . All main effects and interactions are confounded to the 
same extent with rows and columns. 

12.12 Partially Balanced Squares, For a balanced design, the availa- 
ble numbers of replications are rather severely restricted. Although de- 
signs are possible for other numbers of replicates, they lack the symmetry 
of the designs described above, with the consequence that the statistical 
analysis usually becomes more complicated. It happens, however, that 
if the number of replications is less than that required for a balanced de- 
sign, the statistical analysis follows the same procedure as for balanced 
designs, apart from minor changes in some formulae. In experimenta- 
tion where double grouping has proved effective but where the replica- 
tions are not sufficient for a balanced lattice square, the additional de- 
signs shown in table 12.2 may be useful. 

TABLE 12.2 Other lattice square designs 

Number of treatments 49 64 81 121 169 

Number of replicates 3 3, 4 3, 4 3, 4, 5 3, 4, 5, 6 

When the number of treatments is odd, the plans for these designs are 
obtained simply by taking the desired number of replicates from the 
plans for the balanced designs. Thus, for 49 treatments in threefold 
replication, we use squares I to III from plan 12,4. With 64 treatments, 
squares I, III, and V are used for 3 replicates, and squares I, III, V, and 
VII for 4 replicates. 

Partially balanced lattice squares are analogous to the triple, quad- 
ruple, and quintuple lattices. 
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LATTICE SQUARES 


12.13 


12.13 Arrangement of Experimental Material. The method of ar- 
ranging the experimental material is similar to that for an ordinary latin 
square. With k 2 treatments, the units are arranged in k X k squares so 
that the rows and columns of each square correspond to the two types of 
variation whose effects we wish to eliminate from the errors. In field ex- 
periments, the plots of each replicate are usually laid out in square forma- 
tion, in which case row and column differences represent fertility vari- 
ations in two directions at right angles to each other. If the width of a 
greenhouse bench accommodates 5 pots, 25 treatments can be laid out in 
replicates which consist of 5 rows of 5 pots each. 

12.14 Randomization. Within each replicate the rows and columns 
of the basic plan should be permuted separately at random before apply- 
ing the treatments. It is also advisable to assign the treatments at ran- 
dom to the treatment numbers in the plan. 


12.2 Statistical Analysis 

12.21 Designs with {k + l)/2, or Fewer, Replications. The same in- 
structions apply to the partially balanced designs and to the balanced 
designs with (k + l)/2 replications. Table 12.3 shows the plan and 
yields for a lattice square with 25 com hybrids in 3 replications, con- 
ducted by the North Carolina Experiment Station in 1942. Each plot 
contained 4 rows, with 10 hills per row. 

The computations proceed as follows. The number of treatments is 
/c 2 , and the number of replications r, 

1. Find the row, column, treatment, replication, and grand totals. 
The treatment totals are placed in a square under the plan. 

2, For each row, calculate the L value, where 

L =» total (from all replications) of all treatments included in the 
row — r times row total 


U « 89.8 + 93.3 + 82.1 + 88.9 + 91.1 - 3(159.1) - -32.1 
As a check, the total of the L values over a replication equals 
(Grand total) — ^(replication total) 

Thus, 

-76.2 - 2189.4 - 3(755.2) 

Similarly for each column we obtain M values. 


M ® total (from all replications) of all treatments included in the 
column — r times column total 



TABLE 12.3 Yields (pounds corn per plot) for a 5 X 5 lattice square in 

3 REPLICATIONS 


Treatment numbers are shown in parentheses 
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Square II 
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Square III 
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+28.0 


+277.2 


g + 4,45 

+ 8.19 + 2.S4 

+ 2.42 

+ 1.45 

+14,35 







Grand total 

2189.4 

0 

0 




Treatment totals 




(i) 

78.7 

(2) 

88.9 (3) 78.9 

(4) 90.2 

(5) 

102.3 


(6) 

82.0 

(7) 

88.1 (8) 103.5 

(9) 93.3 

(10) 

95.1 


(ID 

82.1 

(12) 

91.6 (13) 78.0 

(14) 83.2 

(15) 

94.3 


(16) 

94.0 

(17) 

90.2 (18) 89.8 

(19) 102.0 

(20) 

72.8 


(21) 

77.2 

(22) 

88.5 (23) 74.5 

(24) 77.3 

(25) 

91.1 

2189.4 


Adjusted treatment totals 

(1) 83.7 (2) 85.7 (3) 87.9 (4) 88.9 (5) 95.3 

(6) 84.4 (7) 81.6 (8) 100.1 (9) 87.4 (10) 97.4 

(11) 90.1 (12) 90.1 (13) 78.5 (14) 86.2 (15) 98.7 

(16) 85.5 (17) 83.8 (18) 93.8 (19) 107.0 (20) 77.0 

(21) 78.1 (22) 94.4 (23) 72.6 (24) 79.7 (25) 81.5 

349 



350 


LATTICE SQUARES 


12.21 


3. We now compute the analysis of variance. The total s.s. and the 
sums of squares for replications and treatments are found in the usual 
way. If the symbol L r denotes a replication total of the L’s, the sum of 
squares for rows within replications, adjusted for treatments, is 

£l 2 zv 

fcr(r - 1) k 2 r(r - 1) 

(32. 1) 2 + (10.5) 2 + • • •+ (5.1 ) 2 (70.2) 2 + (201.0) 2 + (277.2) 2 

= 30 150 

= 1405.95 - 820.32 = 585.63 
The siun of squares for columns, eliminating treatments, is 
(16.0) 2 + (3.8) 2 + • • • + (28. 0) 2 (7G.2) 2 + (201. 0) 2 + (277.2) 2 

_ — J5Q 

= 1058.53 - 820.32 = 238.21 
The complete analysis of variance is shown in table 1 2.4. 


TABLE 12.4 Analysis of variance for a 5 X 5 lattice square 




d.f. 


s.s. 

m.s. 

Replications 

Gr- 

-1) 

2 

546.88 

273.44 

Treatments 

OP 

-1) 

24 

611.09 

25.46 

Rows (adj.) 

r{k 

- U 

12 

585.63 

48.80 E r 

Columns (adj.) 

r(k 

-1) 

12 

238.21 

19,85* 

Error ( 

[k - 1 )(rk - 

- r - k - 1) 

24 

229.79 

9.57* 

Total 

(rife 2 

-1) 

74 

2211.00 



4. This step leads to the adjusted treatment means. Let 
E r = mean square for rows 
E c — mean square for columns 
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5. Multiply the L' s by A' to give S values, and the ill’s by n' to give 
e values, as presented in table 12.3. 

6. The adjusted total for any treatment is secured by adding to the 
unadjusted total the S and e values for each row and column in which 
the treatment appears. For treatment (1), we have 

78.7 + (-0.84) + (-2.13) + (-2.72) + 

(-0.65) + (8.51) + (2.84) = 83.7 

To obtain the treatment means (not shown) we divide by the number 
of replicates. 

7. The average variance of the difference between two adjusted treat- 
ment means is 


2 E, 

r 


1 + 


rk 

(k + 1 ) 


(X' + mO 


2(9.57) 


1 + 


(3) (5) 
6 


8.49; s.e. = 2.91 


(0.1322) 


Except perhaps with the 5X5 and smaller squares, it is sufficiently ac- 
curate to use this figure for comparisons between any pair of treatments. 
More accurately, for two treatments that appear in the same row, the 
variance of the difference is 


— [1 + (r - 1)A' + ryT] * 8.40; s.e. - 2.90 
r 


For two treatments in the same column, 

2E 

— [1 + rX' + (r - !>'] = 8.58; s.e. - 2.98 
r 

Clearly the average error is good enough in this experiment. For the 
partially balanced designs one more formula is needed, since some pairs 
of treatments do not occur together either in a row or in a column. The 
variance of the difference is 

2jE7 

[1 -f~ r(X' + //)] 

r 

8. As is typical of these designs, the analysis does not provide an exact 
F-test, The treatments m.s. in the analysis (table 12.4) cannot be com- 
pared with the error m.s., since the former has not been adjusted for row 
and column effects. For an approximate F-test, compute the sum of 
squares of deviations of the adjusted treatment totals in table 12.3. This 
comes to 1 (500.05. Division by 3, since there are 3 replicates, gives 
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535.35, with a mean square of 22.31. This may be compared with the 
effective error m.s, ? which is 


Ec 


1 + 


rk 


k + 1 


(X' + M') 


9.57 


15 

1 + — (0.1322) 


12.73 


The JP-ratio is 22.31/12.73, or 1.75, with 24 and 24 d.f. 

9. The gain in precision relative to randomized blocks is estimated as 
follows. If the experiment were analyzed as randomized blocks, rows 
and columns would be amalgamated with error, giving 



d.f. 

s.s. 

m.s. 

Replications 

2 

546.88 


Treatments 

24 

611.09 


Error 

48 

1053.63 

21.95 


This error, 21.95, is compared with the effective error m.s., 12.73. The 
relative information is estimated as 21.95/12.73, or 172%. Thus 3 
replicates of the lattice square appear about as precise as 5 with ran- 
domized blocks. 

Often a comparison with a lattice design will be of more interest. This 
experiment could have been planned as a triple lattice, and in that event 
the rows would probably have been chosen as blocks. The intra-block 
error would be derived from the pooled sum of squares for columns and 
error in table 12.4. The mean square would be 13.00, with 36 d.f. Con- 
sequently, for the triple lattice we have Eh — 48.80 (m.s. for row's), E e = 
13.00. The effective error m.s. (section 10.28) is 


E e 



rkfx 

T+l, 


E 



rJJh ~ B,) - 

(* + l)(r ~ 1 )E b . 


13.00 


1 + 


3(48.80 - 13.00) 
(6) (2) (48.80) J 


15.38 


The relative accuracy of the lattice square to the triple lattice is 
estimated as 15.38/12.73, or 121%. 

An account of the theory and a worked example are given by Yates 
(12.1) for balanced squares and by Cochran (12.2) for partially balanced 
squares. 


12.22 Designs with (k + 1) Replications, In this case the analysis is 
slightly different, since it is possible to apply a single adjustment for all 
the rows in which a treatment lies, instead of making a separate adjust- 
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TABLE 12.5 Percentages of squares attacked by bold weevils for a 
4X4 lattice square in (k + 1) REPLICATIONS 


Boll weevil infestation 

Bow 

Square I totals 


(10) 

9.0 

(12) 

20.3 

(9) 

17.7 

(id 

26.3 

73.3 

(2) 

4.7 

(4) 

9.0 

(1) 

7.3 

(3) 

8.3 

29.3 

(14) 

9.0 

(16) 

6.7 

(13) 

11.7 

(15) 

4.3 

31.7 

(fi) 

4.0 

(8) 

5.0 

(5) 

5.7 

(7) 

14.3 

29.0 

Column totals 

20,7 


41.0 


42.4 


53.2 

163.3 




Square II 





(5) 

19,0 

(12) 

8.7 

(15) 

13.0 

(2) 

15.7 

56.4 

(10) 

12,0 

(7) 

6.0 

(4) 

15.3 

(13) 

12.0 

45.3 

(10) 

12,7 

(1) 

6.3 

(6) 

1.7 

(H) 

13.0 

33.7 

(3) 

3.7 

(14) 

3.7 

(9) 

8.0 

(8) 

13.3 

28.7 

Column totals 

47.4 


24.7 


38.0 


54.0 

164.1 




Square III 





(10) 

17.0 

(15) 

7.0 

(8) 

10.3 

(1) 

1.3 

35.6 

(0) 

11.3 

(10) 

12.3 

(7) 

3,0 

(2) 

5.3 

31.9 

(12) 

12.3 

(13) 

8.7 

(6) 

8.0 

(3) 

9.8 

38.3 

(11) 

30.3 

(14) 

22.3 

(5) 

11.0 

(4) 

12.7 

76.3 

Column totals 

70.9 


50.3 


32.3 


28.6 

182.1 




Square IV 





(10) 

5.0 

(12) 

10.3 

(8) 

5.7 

(4) 

12.7 

33.7 

(ID 

2.7 

(15) 

6.7 

(3) 

10.3 

(7) 

5.7 

25.4 

(1) 

1,0 

(5) 

10.3 

(9) 

11.3 

(13) 

11.7 

34.3 

(0) 

11.0 

(2) 

19.0 

(14) 

20.7 

(10) 

29.7 

80.4 

Column totals 

19.7 


46.3 


48.0 


59.8 

173.8 




Square V 





(3) 

2.0 

(16) 

5.0 

(5) 

4.0 

(10) 

13.7 

24.7 

(6) 

9.3 

(9) 

1.7 

(4) 

6.3 

(15) 

12.3 

29,6 

(12) 

16.7 

(7) 

4.3 

(14) 

18.7 

(1) 

8.7 

48.4 

(13) 

16.7 

(2) 

30.0 

(11) 

25.7 

(8) 

14.0 

86.4 

Column totals 

44.7 


41.0 


54.7 


48.7 

189.1 


872.4 



354 


LATTICE SQUARES 


12.22 


ment for every row. On the whole, however, the analysis is more com- 
plex than with (k + l)/2 replications. 

The example is a 4 X 4 lattice square conducted by the U. S. Bureau 
of Entomology and Plant Quarantine at Florence, S. C., and described 
by Wadley (12.3). The treatments were 16 arsenical insecticides applied 
to cotton with a hand dusting machine. Plots were 10 rows wide and 
70 feet long, being about }{g acre. To allow for border effects, records 
were taken only from the 4 center rows. The data in table 12.5 are the 
percentages of squares * showing attack by boll weevils. These figures 
were obtained by examining 100 squares per plot, 25 from each of the 
4 center rows. Such counts were made at intervals during the summer: 
the data are averages from 3 counts made in August. 

The simplest computational routine, devised by Yates (12.1), will be 
presented here, though it is not too well adapted for making clear the 
meaning of the various steps. Yates’s paper should be consulted for an 
account of the theory. 


TABLE 12.6 Original and adjusted treatment totals 



Treat- 
ment 
totals , 
T 

Rt 

c t 

D 

u 

J 

K 

M' 

Adj. 

treat- 

ment 

totals 

Adj. 

means 

1 

24.6 

181.3 

164.1 

, 17.2 

64.3 

81.5 

133.1 

150.3 

32.24 

6.45 

2 

74.7 

284.4 

196.6 

87.8 

-250.8 

-163.0 

100.4 

188.2 

68.41 

13.68 

3 

33.6 

146.4 

221.9 

—75.5 

274.8 

199.3 

- 27.2 

-102.7 

43.64 

8.73 

4 

58.0 

234.2 

222.3 

- 7.9 

25.4 

17.5 

- 6.2 

- 14.1 

56,79 

11.30 

5 

50.0 

220.7 

223.1 

- 2.4 

- 31.1 

- 33.5 

- 40.7 

- 43.1 

47.20 

9.44 

6 

34.0 

211.0 

161.4 

49.6 

- 46.6 

3.0 

151.8 

201.4 

37.89 

7.58 

7 

33.3 

180.0 

211.0 

-31.0 

105.6 

74.6 

- 18.4 

- 49.4 

36,85 

7.37 

8 

48.3 

213.4 

224.0 

-10.0 

- 1.4 

- 12.0 

- 43.8 

• — ' o4.4 

46.58 

9.32 

0 

50.0 

197.8 

240.3 

-42.5 

83.4 

40,9 

- 86.6 

-129.1 

50.07 

10.01 

10 

81,4 

259.3 

253,5 

5.8 

- 98.5 

- 92.7 

- 75.3 

- 69.5 

74.57 

14.91 

11 

98.0 

295.1 

252.5 

42.6 

— 211.1 

— 168,5 

- 40.7 

3.9 

87,95 

17.59 

12 

08.3 

250.1 

227.6 

22.5 

-104.9 

- 82.4 

- 14.9 

7.6 

03.51 

12.70 

13 

60.8 

236.0 

251.2 

— 15.2 

- 64.4 

- 79.6 

-125.2 

-140.4 

53.45 

10.09 

14 

74.4 

265.5 

204.4 

61,1 

-157.5 

- 96.4 

86.9 

148.0 

71.36 

14.27 

15 

43.3 

178.7 

230.5 

-57.8 

152.1 

94,3 

- 79.1 

-136.9 

46.42 

9.28 

10 

41.7 

155.7 

199,4 

-43.7 

260.7 

217.0 

85.9 

42.2 

55.46 

11.09 

Totals 

872.4 

3489.6 

3489.6 

0 

0 

0 

0 

0 

872.30 

174.47 


1, Find the row, column, replication, and grand totals, all shown in 
table 12.5. 

2. Find the treatment totals T (table 12.6). For each treatment find 

Rt total of all rows in which the treatment appears 
C t = total of all columns in which the treatment appears 

* A “square” is the name given to the young flower bud. 
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It may save time to find all 3 quantities, T t R i} and C f , by simultaneous 
addition, since this means that the positions of a treatment in the dif- 
ferent replicates are found only once. This can be done if the machine 
carriage is moved so as to accommodate 3 sets of running totals. The 
sum of the R t values and the sum of the C t values should each equal k 
times the sum of the T values. 

Thereafter the successive columns in table 12.6 are filled out as follows. 
D~R t -C t 

IJ = kT ~ (k + 1 )R t + 0 
J « D + U 
K = J — (— (/c — \)D 
M f = D + K 

Note that the total of each of these quantities is zero. 

3. Compute the analysis of variance. The only sums of squares re- 
quiring special instructions are those for rows and columns. After ad- 
justment for treatment effects, it happens unfortunately that the sums 
of squares for rows and columns are not mutually orthogonal. By a 
well-known result in the analysis of variance, their combined sum of 
squares may be expressed in either of two ways: 

Rows (adj. for treatments) + columns (adj. for treatments and rows) 
columns (adj. for treatments) + rows (adj. for treatments and 
columns) 

Both expressions are computed. 

Sum of squares for rows adjusted for treatments 

S(Lf __ (64, 3) 2 4- (260.8)® + •; • + (26Q.7) 2 
“ k\k + 1) ~ 320 

Sum of squares for rows adjusted for treatments and columns 

8(J' 2 ) __ (81. 5) 2 + (1 63.0) 2 + -4- (217.0) 2 

~ ¥{k - 1) ~ 192 

$(Af') 2 

Sum of squares for columns adjusted for treatments ~ qCJj 
Sum of squares for columns adjusted for treatments and rows 

m 

k\k - 1) 


- 1093.02 

- 1026.76 

- 625.85 


559.50 
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Note that 1093.02 + 559.59 - 1026.76 + 625.85. 
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TABLE 12.7 Analysis of variance 



d.f. 


s.s. 

m.s. 

Replications 

h 

4 

31.56 

7.89 

Treatments 

/c 2 - 1 

15 

1244.20 

82.95 

Rows (adj. for treatments) 

it 2 - 1 

15 

1093.02 


Rows (adj. for tr, and col.) 

A- 2 — 1 

15 

1026.76 

6S.45E, 

Columns (adj. for tr.) 

A- 2 - 1 

15 

625.85 


Columns (adj. for tr. and rows) 

k 2 - 1 

15 

559.59 

37. 3 1£', 

Error 

(A: 2 - l)(k. - 2) 

30 

680.17 

22.077?, 

Total 

k 3 + A; 2 - 1 

79 

3608.54 



In finding the error s.s. by subtraction, note that we subtract the com- 
bined effect of rows and columns only once . 

4. This leads to the adjusted treatment totals. Compute 


q = k 2 E r E c - Ej 2 = (16) (68.45) (37.31) - (22.67) 2 = 40,348 


Q = (Jfc - l)g = 121,044 
, (S r - E e )(kE c - E r ) _ (45.78) (126.57) 
X " Q ~ 121,044 ~ 

, (S a - E e )(kE r - B e ) _ (14.64X251.13) 
** ~ Q ~ 121,044 


0.04787 

0.03037 


The adjusted treatment totals are 


T + \ f L f + p'M' 


and are inserted on the right of the M f column m table 12.6. For the 
means, divide by r. 

5. The error variance of the difference between two adjusted means is 

2 E e , , 2(22.67) 

[1 + k(\' + /)] - [1 + 4(0.07824)] - 11.9 

r 5 

To obtain an approximate F-test of the treatments, compute the sum 
of squares of deviations of the adjusted treatment totals, and divide by 
(k + 1 )(k 2 — 1) to give the mean square on a single unit basis. This is 
tested against the effective error m,s., E e [l + k(\ f + //)]. 

The gain in precision relative to a randomized blocks or lattice design 
is estimated by the procedure for the lattice square with (k + l)/2 
replications. 
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12*23 Designs with 2(k + 1) Replications. This case is unlikely to 
be of much practical interest except possibly with the 3 X 3 design in 
8 replicates and the 4 X 4 in 10. The analysis is similar to that for (jfc + 1) 
replications; the changes will be noted very briefly. 

The partition of degrees of freedom is as follows. 


Replications 

d.f. 

(2k + 1) 

Treatments 

~ 1) 

Rows 

Component (a) 

(a 2 - 1) 

Component (6) 

(* 2 - 1) 

Columns 

Component (a) 

(** - 1) 

Component (b) 

(* 2 - 1) 

Error 

{2k - 3)» 2 - 1) 

Total 

2k\k + 1) - 1 


Corresponding to any row, there is another row with the same set of 
treatments. Find the differences D r between the totals of corresponding 
rows. The sum of squares of deviations of the D r from their replication 
means, divided by 2 k, gives component (a) of the rows s.s. Component 
(b) corresponds to the component that was obtained with ( k + 1) rep- 
licates. As before, it has two forms, an V and a J form. Both are found 
in the same way as with (k + 1) replications, except that the divisors 
must be doubled. The sums of squares for columns are derived similarly. 

The formulae for the adjustment factors are a little more complicated. 
Let E r be the mean square found by pooling component (a) and the J 
component of the rows s.s., with an analogous definition for E c . Cal- 
culate 

w _ ± . w _ ( 2 * - 1 ) . w m (» - 1 ) 

* E c ’ r 2 kE r -E/ ° 2 kE c -E 9 

(Wi — Wr) 4 Wi~W c 

X ~ k[Wr + W f + (7fi - 1 Wi] ' M Wr + W e ~ (k ~ l)W& 
The remainder of the analysis proceeds without change. 


12.24 Missing Data. The procedure for analyzing lattice squares 
when certain observations are missing has been worked out by Cornish 
(12.4). As in the case of lattices (section 10.13), the method requires 
two different estimates of each missing observation, one being that found 
by minimizing the intra-row-and-column error, the other being that ap- 
propriate to the case where the data are analyzed as a randomized block 
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experiment. In order to reduce the amount of computation, we present 
a cruder method which should be satisfactory if only a small fraction of 
the total observations is missing. 

In this approach we estimate each missing observation by the first 
method mentioned above, that is, by minimizing the intra-row-and- 
column error s.s. Thereafter the analysis proceeds as usual: 1 d.f. is 
deleted from the total and intra-row-and-cohimn error s.s. for each miss- 
ing value. Special rules for 2-tests are given later. 

The following totals are used in the formulae for inserting estimates 
in place of the missing values. R , C, T, and P are the totals of the row, 
column, treatment, and replication, respectively, that contain the miss- 
ing observation, while G is the grand total. Further, 

S x = total of all other rows and columns in which the treatment with 
the missing value appears (note that this sum does not in- 
clude the row and column which contain the missing value) 

T$ « total (from all replicates) of all other treatments which appear 
in the row or column that contains the missing value (this 
total does not include the treatment that has the missing 
value) 

Since it is important that the definitions of 8 X end T x be correctly 
understood, it is worth working through the numerical example given 
below. 

Experiment with (k + t)/2 replications 

_ k(k - 1 )(R + C) - (fc + 3 )P + 2 k(k - 2 )T 4-6(3- 2 foS* - 2 kT x 
x (T^L)^ft^3) 

Experiment with (k + 1) replications 

^ _ k(k - 1 )(ft + C)- (k + 1)P + k(k ~ 1)T + 3(7 - kS x - kT* 

(k^ 17(fc^2) 

Experiment with less than (k + 1)/S replications. Because of the lack 
of symmetry in this design, an additional symbol is needed, 

Ux — total (from all replicates) of all treatments that appear in the 
same row or column as the treatment that has the missing 
value. 

The distinction between U x and T x should be realized. T x is a total 
over only those other treatments that are in the actual row or column 
that contains the missing value, while U x is a total over all other treat- 
ments that appear somewhere in the experiment in the same row or 
column as the treatment that has the missing value. Thus T x is a part 
of ^ It may help to note that T x is a total over 2 (k — 1) treatments, 
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or 2r(k - 1) observations, where r is the number of replications, while 
U x is a total over 2 r(k - 1) treatments, or 2 i- 2 (k - 1) observations. 

| kr(r — 1 )(R + C) — r(r + 1 )P + k(lc — 2 )(r — 1)T| 

1 + (r + l)g - krSx - krT x + } 

(k — l)(r — l)(fcr — /c — r — 1) 

Example. In the corn experiment in table 12.3, suppose that treat- 
ment (18) in replication I is missing. Here k = 5, r = (k + l)/2 = 3. 
Omitting the observation 33.3 for this treatment, we find 

R = 125.8; C = 119.9; R + C = 245.7; P = 721.9 
T = 56.5; G = 2156.1 

S x is the total of the rows and columns in which treatment (38) appears 
in squares II and III. Thus 

8, = 165.1 + 145.3 + 109.8 + 126.0 = 546.2 

Finally, T x is the total for treatments (2), (9), (1 1), and (25), which ap- 
pear in the row that has the missing value, plus that for treatments (5), 
(6), (12), and (24) which appear in the column with the missing value. 
These totals are to be taken from the table of treatment totals. This 
gives T. r = 709.2. Then 

( (20) (245. 7) - (8) (721. 9) + (30) (56.5) | 

1 + (6) (2156.1) - (10) (546.2) - (10) (709.2)1 

x = = 38.0 

32 

The exact formulae for /-tests are complicated, and it appears that any 
good approximate rule would also be rather complicated. The following 
rule is suggested, though it somewhat underestimates the standard 
errors for the smaller squares and also lacks the simplicity that might be 
desired. For a comparison between the means of two treatments A and 
we assign to each an effective amount of replication which depends on 
the number and situation of the missing values. When scoring A we 
examine each replicate in turn and assign a score by the following rules. 


Case Score to A 

L A missing 0 

IT. A present, and B in the same row or column as A 

i. B missing 0 

ii. B present 1 

III. A present, and B not in the same row or column as A 

i. Other values missing in both row and column H 

ii. Other values missing in row or column hut not in 


iii. No values missing in row or column 
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The effective replication for A is of course its total score over all rep- 
lications in the experiment. B is scored similarly. Suppose that in the 
example the mean of treatment (18) were being compared with that of 
treatment (9), which occurs in the same row in square I. The effective 
replication for treatment (18) is 2, since case I arises in square I. The 
effective replication for treatment (9) is also 2, since case II (i) arises in 
square L 
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Plan 12.1 3X3 balanced lattice square 

t = 9, k ~ 3, r — 2, rows - 6, columns » 6, X ** 1 * 
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Plan 12.2 


4X4 balanced lattice square 


t « 16, k » 4, r « 5, rows — 20, columns » 20, X ~ 2 
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Plan IB. 8 5X5 balanced lattice square 


t = 25, k « 5, r = 3, rows «= 15, columns « 15, X = 1 
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Plan 18.4 7X7 balanced lattice square 

t » 49, k — 7 , r - 4, rows *» 28, columns = 28, X ~ 1 
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Plan 12.5 8 X 8 balanced Lattice square 


t « 64, k = 8, r - 9, rows * 72, columns * 72, X = 2 
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17 50 3 36 29 62 15 48 
25 42 35 4 21 14 63 50 

46 2 17 35 16 53 60 31 

64 23 3 25 54 12 45 34 

55 40 29 4 58 41 11 22 

5 13 21 20 37 45 53 61 

33 58 27 20 5 54 47 16 

28 9 50 63 5 24 38 43 

6 14 22 30 38" 46 54 62 

41 26 59 12 53 6 39 24 

37 51 15 42 20 6 32 57 

7 15 23 31 39 47 55 63 
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Plan 12.6 9X9 balanced lattice square 


t * 81, k ,« 9, r ** 5, rows 
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Plan 18.7 


LATTICE SQUARES 
11 X 11 balanced lattice square 
t = 121, k « 11, r * 6, rows = 66, columns » 66, X * 1 


Square I 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

1.3 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38” 

39 

40 

41 

42 

43 

44 

45 

46 

47 

48 

49 

50 

51 

52 

53 

54 

55 

56 

57 

58 

59 

60 

61 

62 

63 

64 

65 

66 

67 

68 

69 

70 

71 

72 

73 

74 

75 

76 

77 

78 

70 

80 

81 

82 

83 

84 

85 

86 

87 

88 

8 lT 

00 

91 

92 

93 

94 

95 

96 

97 

98 

99 

100 

101 

102 

103 

104 

105 

106 

107 

108 

109 

110 

111 

112 

113 

114 

115 

116 

117” 

118 

119 

120 

121 





Square 

II 





1 

102 

82 

62 

42 

22 

112 

92 

72 

52 

32 

33 

2 

103 

83 

63 

43 

12 

113 

93 

73 

53 

54 

23 

3 

104 

84 

64 

' 44 

13 

114 

94 

74 

75 

55 

24 

4 

105 

85 

65 

34 

14 

115 

95 

96 

76 

45 

25 

5 

106 

86 

66 

35 

15 

116 

117 

97 

77 

46 

26 

6 

107 

87 

56 

36 

16 

17 

118 

98 

67 

47 

27 

7 

108 

88 

57 

37 

38 

18 

119 

99 

68 

48 

28 

8 

109 

78 

58 

59 

39 

19 

120 

89 

69 

49 

29 

9 

110 

79 

80 

60 

40 

20 

121 

90 

70 

50 

30 

10 

100 

101 * 

81 

61 

41 

21 

111 

91 

71 

51 

31 

11 

Square III 

1 

115 

108 

90 

83 

76 

58 

51 

44 

26 

19 

20 

2 

116 

109 

91 

84 

77 

59 

52 

34 

27 

28 

21 

3 

117 

110 

92 

85 

67 

60 

53 

35 

36 

29 

22 

4 

118 

100 

93 

86 

68 

61 

” 54 

55 

37 

30 

12 

5 

119 

101 

94 

87 

69 

62 

63 

45 

38 

31 

13 

6 

120 

102 

95 

88 

70 

71 

64 

46 

39 

32 

14 

7 

121 

103 

96 

78 

79 

72 

65 

47 

40 

33 

15 

8 

111 

104 

97 

98 

80 

73 

66 

48 

41 

23 

16 

9 

112 

106 

106 

99 

81 

74 

56 

49 

42 

24 

17 

10 

113 

114 

107 

89 

82 

75 

57 

50 

43 

25 

18 

11 



PLANS 


365 


Plan 12.7 (Continued) 11 X 11 balanced lattice square 
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Plan 18.8 13 X 13 balanced lat tice {square 

t » 169, k * 13, r « 7, rows » 91, columns ** 91, X 1 
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Plan 1 3.8 ( Continued ) 13 X 13 balanced lattice square 
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Plan Id.S (Continued) 13 X 13 balanced lattice square 
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INCOMPLETE LATIN SQUARES (YOUDEN SQUARES) 

13*1 Description 

13*11 Yotiden Squares* These designs, which are constructed by a re- 
arrangement of certain of the balanced incomplete blocks, possess the 
characteristic “double control” of the latin square, without the restric- 
tion that the number of replicates must equal the number of treatments. 
A latin square with 13 treatments is rarely used, because it necessitates 
13 replications. There are, however, incomplete latin squares for 13 
treatments in either 4 or 9 replicates. Every treatment occurs once in a 
column (replication), and every pair of treatments appears together an 
equal number of times in the same block. Column differences are 
eliminated automatically from the treatment comparisons, while block 
differences may be removed by adjusting the treatment mean yields in 
the same way as with balanced incomplete blocks. Most of the designs 
were developed by Youden (13.1, 13.2), whose name is commonly as- 
sociated with them; previously Yates (13.3) had drawn attention to the 
group of designs in which the number of replicates is one less than the 
number of treatments. Table 13.1 summarizes the designs for numbers 
of treatments up to 37. 


TABLE 13.1 Summary of incomplete latin squares (r less than t) 


t 

r ~ k 

X 

E 

Typef 

i 

r = k 

X 

E 

Type f 

4 * 

3 

2 

89 

II 

13 

4 

1 

81 

I 

5 * 

4 

3 

94 

II 

13 

9 

6 

1)5 

I 

6 * 

5 

4 

96 

II 

15 

7 

3 

92 

I 

7 

3 

1 

78 

11 

15 

8 

4 

94 

I 

7 

4 

2 

88 

II 

16 

6 

2 

89 

T 

7 * 

6 

5 

97 

U 

16 

10 

6 

96 

I 

8* 

7 

0 

98 

II 

19 

9 

4 

94 

I 

0* 

8 

7 

98 

II 

19 

10 

5 

95 

I 

10* 

9 

8 

99 

11 

21 

5 

1 

84 

I 

11 

5 

2 

88 

I 

31 

6 

1 

86 

I 

11 

6 

3 

92 

1 

37 

9 

2 

91 

I 

11 * 10 9 99 II 

Constructed from at XI latin square by omission of the last 

column. 



f This refers to the method of analysis (see section 13*2), 

o*-m 
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The number of replicate* is always equal to the number of unite per 
block. The symbol X shows the number of times that two treatments 
appear together in a block, while E is the efficiency factor expressed in 
percentage. 

Plans are given (plans 13.l-i3.15) for all designs in table 13.1 except 
those marked whit an asterisk (*), which are constructed from latin 
squares by the omission of the last column, A 5 X 5 square, for in- 
stance, furnishes the plan for t = 5, r = 4. 

An interesting application of the design for the control of plant vari- 
ability was made by Youden (13.1) in greenhouse experiments on to- 
haeeo-mosaie virus. The experimental unit was a single leaf, and the 
data consisted of the number of lesions produced per leaf by rubbing the 
leaf with a solution which contained the virus. The numbers of lesions 
had been found to depend much more on inherent qualities of the plant 
than on the position of the plant on the greenhouse bench. Conse- 
quently, each block of the design was a single plant, so that the large dif- 
ferences in responsiveness which existed among plants did not contribute 
to the experimental errors. The columns were the positions, from top 
to bottom, of the five leaves which were used on each plant. That is, 
the first replication contained the top leaf of every plant. Since there 
was a fairly consistent gradient in responsiveness down each plant, this 
control also proved effective. 

As in the example above, when laying out an incomplete latin square 
the general principle is to group the units so that differences among 
blocks and differences among columns represent the major sources of 
variation that arc known or suspected, 

13.12 Randomization. The steps are 

1. Rearrange the blocks of the plan at random. 

2. Rearrange the replications of the plan at random. 

"When a plan is repeated for additional replication, blocks and columns 
are randomized separately within each repetition . The repetitions may 
be kept separate, 

13.2 Statistical Analysis 

In experiments with more than 10 treatments, the numbers of degrees 
of freedom for blocks are large enough to allow the use of inter-block in- 
formation, The appropriate analysis is described in the next section. 
For small experiments, where inter-block information should be ignored, 
the analysis is given in section 13.22. The recommended method of 
analysis for any experiment is indicated both in table 13.1 and in the 
plans by the “type” symbol (I or II), 
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13,21 Type I. Analysis with Recovery of Inter-block Information. 
The steps are as follows (t — number of treatments, k ~ number of units 
per block = r). 

1. Calculate the treatment totals T, the column totals, the block 
totals, and the grand total G. 

2. For each treatment, calculate the total B t of all the blocks which 
contain the treatment. Place these quantities in a column next to the 
column of treatment totals. Then form a third eohunn of the quantities : 

W ~ (t~ k)T - (t- 1 )B t + (k - ])G 
The IPs should sum to zero. 

3. The separation of degrees of freedom in the analysis of variance is 
as follows. 



d.f. 

m.s. 

Columns (replications) 

(k - 1) 


Blocks (eliminating treatments) 

(t - 1) 

R* 

Treatments 

(t - 1) 


Error 

ft - m - 1) 

Re 


Total (tk - 1) 


The total s.s. and the sums of squares for columns and treatments are 
found by the usual methods. The sum of squares for blocks (eliminating 
treatments) is the sum of squares of the TPs, divided by kt(t — k)(k — 1). 
The error s.s. is obtained by subtraction. 

4, Calculate the factor 

_ (E b - Ee) 

11 tQc - 1 )E b 

The adjusted total for any treatment is 

Y » T + yW 

Should Eh be less than E e , g is taken as zero and no adjustments are 
made to the treatment totals. The adjusted treatment means are ob- 
tained on dividing each F by r. The estimated error variance of the dif- 
ference between two adjusted treatment means is 

— [ l + <*-r) M I 

r 

13,22 Type II. Analysis without Recovery of Inter-block Information. 
The analysis proceeds as follows. 
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1. Calculate the column totals, the block totals, the treatment totals 
T } and the grand total G, 

2. For each treatment, obtain the quantity 

Q = kT - B t 

where Bt is the total of all the blocks in which the treatment appears. 
The quantities Q should sum to zero. 

3. The degrees of freedom in the analysis of variance are partitioned 
as in section 13.21. In this case the blocks s.s. is calculated without elim- 
inating treatment effects, while the treatments s.s. is adjusted for block 
differences. The total s.s, and the sums of squares for blocks are com- 
puted by the standard procedure. The treatments s.s. (adjusted) is the 
sum of the squares of the Q’s, divided by tk\. The error s.s. is found by 
subtraction. 

4. In order to obtain any treatment mean, adjusted for block dif- 
ferences, divide the corresponding Q by t\ and add to the quotient the 
mean for the whole experiment. 

The estimated error variance of the difference between two adjusted 
treatment means is 

2 E e (t - 1) 

o - 1 )t 

Numerical examples of this analysis for t = 7, r == 3, and t = 21, 
r = 5 are given in reference (13.1), and an example for t = 6, r = 5 in 
(13.3). 

13.23 Type la. Repetitions of Type I. Let n be the number of repli- 
cations in the basic plan, which is used p times in ail experiment, so that 
the total number of replications r = np , 

The method of analysis presented here may be used with any of the 
designs in table 13.1, since even with small numbers of treatments the 
repetition provides sufficient degrees of freedom for estimating the inter- 
block variation. If, however, the efficiency factor exceeds 95%, it is 
scarcely worth while to utilize inter-block information. 

The changes necessary in the procedure of section 13.21 are outlined 
below, the numbers referring to the steps in that section. 

1. Unchanged. Corresponding to any block, there are (p — 1) other 
blocks which contain the same set of treatments. The block totals 
should be arranged in a table (table A, say) with t rows and p columns. 

2. Unchanged. Note that the calculation of the quantities B t is ex- 
pedited by the use of the row totals of table A . 
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3, The analysis of variance is as follows. 

m.s. 

E b 
Ec 

Total (pkt - 1) 


Columns (replications) 
Blocks 

Component (a) 
Component {h) 
Block total 
Treatments 
Error 


d.f. 

{kp - 1) 

(p - IX/ - 
(t ~ 1) 
Pit - 1) 
(l - 1 ) 

it - 1 )(pk - p 


1 ) 


1 ) 


In this analysis, the (p - 1) degrees of freedom among repetitions 
have been ascribed to the columns, so that the blocks s.s. is actually a 
sum of squares among blocks within repetitions . 

Component (a) of the sum of squares for blocks consists of comparisons 
among blocks which contain the same set of treatments and is the inter- 
action s.s. for table A . 

Component (h) is the sum of squares of the IF s, with a divisor 

pkt(t — k)(k — 1). 

4. Unchanged except that 

-p{E h - Ee) 

M pt(k - 1 )S b - (t - k)(p - 1 )E C 

The estimated error variance of the difference between two adjusted 
treatment means is 

— [ 1 + (*-»)**] 

r 

13.24 Type Ha. Repetitious of Type II. As we have pointed out, 
in designs where the efficiency factor is high, inter-block information 
can be neglected. The changes required from section 13.22 should 
present no difficulty. The degrees of freedom subdivide as in section 
13.23. The p(t — 1) degrees of freedom among blocks within repetitions 
are calculated in the usual way, each repetition furnishing {i — 1) de- 
grees of freedom. Notice that the. divisors for the treatments s.s. and 
for the quantities Q must be increased by the factor p. 


13.25 Missing Data. The formula for inserting an estimate in place 
of a missing observation is obtained by minimizing the intra-block error. 
This is the correct estimate for the smaller experiments where inter- 
block information is ignored. For reasons given in section 10.13, this 
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estimate will also be used to provide an approximate solution when 
inter-block information is recovered. The formula is quite similar to that 
for balanced incomplete blocks, with the addition of an extra term in- 
volving the column (or replication) total. 

Let C, B, and T be the totals of the column, block, and treatment that 
contain the missing value, and let G be the grand total. Further, as in 
the statistical analysis, let B t be the total of all blocks in which the 
treatment with the missing value appears (there will, of course, be r such 
blocks). Finally, 

T' — total (over all replicates) of all other treatments that appear in 
the block which has the missing value 

B t ' = total of the B t values for all other treatments that appear in the 
block with the missing value 

The estimate x of the missing value is 

X[r<7 + tB+(t- 1 )T - G] — rT' - (r - 1 )B t + B / 

££ — — — ■ — , ■ ... 

r(r — 1 )(r — 2) 

The symbol X is given for each design in table 13. L Care must be 
taken not to confuse the block totals with the column totals. 

If the experiment contains p repetitions, the formula is 
_ p[pk\C + pf-XB + h(k - 1 )T - pXR - kT f - (k ~ \)B t + B/} 
r(k — 1) (pk — p — 1) 

where R is the total of the repetition in which the missing value occurs, 
and X is as in table 13.1. 

13.3 Other Designs for Small Numbers of Treatments 

13.31 Description. When the number of treatments is small, it is 
sometimes useful to have a design of the “latin square” type in which 
the number of replicates exceeds the number of treatments. Some de- 
signs can be obtained by repetition of an ordinary latin square or of a 
Youden square. Additional plans can be constructed by adding a You- 
den square to a latin square. A selection of these designs is shown in 
table 13.2 for numbers of treatments up to 7. 

. It will be noted that plans are available for most numbers of replicates 
up to 10. Actually, plans can be made for any number of replicates up 
to 10; those shown here have been selected for ease of analysis. 
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TABLE 13.2 Incomplete latin squares for small numbers of treatments 

(>■ > t) 


t 

k 

r 

E 

Ref. to 
plan 

Type f 

t 

k 

r 

E 

Ref. to 
plan 

Type f 

3 

5 

5 

96 

13.10 

III 

4 

9 

9 

99 

13.22 

IV 

3 

3 

6 

100 

2L.S. 


4 

5 

10 

96 

13.20 

Ila 

3 

7 

7 

98 

13.17 

IV 

5 

6 

6 

97 

13.23 

IV 

3 

8 

8 

98 

13.18 

III 

5 

4 

8 

94 

* 

la 

3 

3 

9 

100 

3 L.S. 


5 

9 

9 

99 

13.24 

III 

3 

5 

10 

06 

13.16 

Ila 

5 

5 

10 

100 

2 L.S. 


3 

10 

10 

99 

13.19 

IV 

6 

7 

7 

98 

13.25 

IV 

4 

5 

5 

96 

13.20 

IV 

6 

5 

10 

96 

* 

la 

4 

3 

6 

89 

* 

la 

7 

4 

8 

8S 

13.2 

la 

4 

7 

7 

98 

13.21 

III 

7 

8 

8 

98 

13.26 

IV 

4 

4 

8 

100 

2 L.S. 


7 

3 

9 

78 

13.1 

la 

4 

3 

9 

89 

* 

la 








* By repetition of the plan for r = l — 1 , which is constructed by talcing a t X t 
latin square and omitting the last column. 

f This refers to the method of analysis (see section 13.2). 


TABLE 13.3 Incomplete latin square for 4 treatments in 7 replicates 
Treatment symbols and yields of tomatoes (pounds) 


Column 


Block 

I 

II 

nr 

IV V 

VI 

VII 

Total 

1 

2 

2 

4 

4 3 

3 

1 



50 

72 

83 

82 76 

89 

74 

526 

2 

1 

3 

3 

1 4 

2 

4 



40 

59 

71 

91 59 

73 

52 

445 

3 

4 

1 

2 

3 1 

4 

2 



43 

57 

58 

98 54 

71 

51 

432 

4 

3 

4 

1 

2 2 

1 

3 



48 

54 

74 

97 75 

75 

54 

477 

Totals 

. 181 

242 

286 

368 264 

308 

231 

1880 







Adjusted means 




T 

Q - kT -f B* 

Q -48 

(subtract 11.2) 


1 


465 

3,781 

78.8 


67.6 


2 


476 

3,777 

78.7 


67.5 


3 


495 

3,897 

81.2 


70.0 


4 


444 

3,585 

74.7 


63.5 

Cheek totals or mean 

1,880 

15,040 



67.1 
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13.32 Type III. Statistical Analysis When it- 1. In this case 
the size of block, which equals the number of replicates, is 1 less than 
some multiple (i) of the number of treatments. Since these designs have 
not been discussed in the literature, an example is given from uniformity 
data. Table 13.3 shows the tomato yields in pounds of 28 single-row 
plots, Hartman and Stair’s data (13.1), on which the design for 4 treat- 
ments in 7 replicates has been superimposed. 

Since the plots measured 6 feet X 24 feet, each column (replication) 
is compact, being 24 feet square. There are obviously substantial dif- 
ferences among replicates. The blocks, although very oblong, may also 
exhibit differences in yield, because each block is a separate row of plants. 

The efficiency factors of these designs are all very high, so that inter- 
block information is ignored in the analysis. 

The steps in the analysis are as follows: 

1. Calculate the column totals, the block totals, the treatment totals 
T, and. the grand total G. 

2. A property of these designs is that each treatment is replicated less 
in one block than in the other blocks. Thus, in the example, treatment 
(1) appears only once in the first block, but twice in all other blocks. 
Similarly, treatment (2) is deficient in block 2 and so on. For each 
treatment calculate the quantity 

Q « kT + B f 

where B* is the total for the block in which the treatment is deficient. 
For instance 

Qi « 7 X 465 + 526 - 3781 

As a check, the quantities Q should sum to (k + 1)(?. 

3. All sums of squares in the analysis of variance are obtained in the 
usual way except that for treatments, which is given by the sum of 
squares of deviations of the Q’s, divided by k(k 2 — 1). In this case we 
have 

(3781) 2 „+ (3777) 2 + (3897) 2 + (3585) 2 - j(15,040) 2 = 

336 

The analysis of variance is shown below. The degrees of freedom sub- 
divide as in section 13.21. 



di. 

H.S. 

m.s. 

Columns (replications) 

6 

5387 

897.8 

Blocks 

3 

750 

250.0 

Treatments 

3 

150 

50,0 

Error 

15 

766 

51.1 

Total 

27 

7053 
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The elimination of block differences has substantially reduced the 
error m.s. 

4. To obtain the adjusted treatment mean yields, we first divide each 
Q by (ft 2 — 1), in this case 48. From the resulting quantities we sub- 
tract G/tk(k — 1), or 1880/168 ~ 11.2. As a further check, the mean 
of the adjusted treatment means should equal the mean yield of the 
whole experiment (67.1). The efficiency factor is (ft 2 — 1 )/ft 2 , and the 
estimated error variance of the difference between two adjusted treat- 
ment means is 

2 E e r 

^TTJ 

13.33 Type IV, Statistical Analysis When k * it + 1. The analysis 
closely resembles that of section 13.32. For each treatment there is one 
block in which the treatment has extra replication. Thus, in plan 13.17 
for t » 3, r = 7, treatment (1) appears 3 times in the first block but only 
twice in any other block. The only changes in the computing instruc- 
tions of section 13.32 are (i) Q = kT — B f , where B' is the total of the 
block in which the treatment has extra replication, (ii) the Q’s sum to 
(k — 1 )(?, and (iii) for the adjusted treatment means, divide Q by 
(ft 2 — 1), and add to the quotient the quantity G/fk(k — 1). The change 
in the divisor of G should be noted. 

The error variance of the difference between two adjusted means re- 
mains as in section 13.24. 


REFERENCES 

13 J Youden, W. J. Use of incomplete block replications in estimating tobacco- 
mosaic virus. Conir . Boyce Thompson Inst. 0 , 41-48, 1937. 

13.2 Youden, W. J. Experimental designs to increase accuracy of greenhouse 
studies. Conir. Boyce Thompson Inst. 11, 219-228, 1940. 

13.3 Yates, F. Incomplete latin squares. Jour. Ayr. Sci . 26, 301-315, 1936. 

13.4 Hartman, J. D., and Stair, E. C. Field plot technique studies with tomatoes. 
Proc. Amer. Soc . Hart. Sci. 41, 315-320, 1942. 



PLANS 


379 


PLANS 

Plan 13.1 t = 7, k = 3, ;• = 3, b = 7, X = 1, E = 78, Type II 

Reps. 
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Plan 13 J t = 7, * = 4 , r - 4, b = 7, X = 2, E = 88, Type II 

Reps. 
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380 INCOMPLETE LATIN SQUARES (YOUDEN SQUARES) 
Plan 184 t « 11, k « 6, r - 6, 6 - 11, X « 3, A 7 - 92, Type I 


Reps. 
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Plan ISM i ~ 13, fc « 4, r « 4, & » 13, X - 1, E - 81, Type I 


Reps. 
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Plan 13.6 t * 33, k « 0, r » 9, 6 « 13, X * 6, # m 05, Type I 


Reps, 
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Plan 13 J t - 15, k -7 ,r - 7, b - 15, X - 3, E - 92, Type I 

Reps. 
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382 INCOMPLETE LATIN SQUARES (YOUDEN SQUARES) 

Plan 1S.S t = 15, k = 8, r « 8, b = 15, X = 4, E = 94, Type I 

Reps. 
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P/an t - 16, ft ~ 6, r - M ~ 16, A « 2, - 89, Type I 
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Plan 18.10 


Plan 18.11 


i « 16, k = 10 , r = 10, b * 16, X - 6, E = 96, Type I 
Reps. 
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384 INCOMPLETE LATIN SQUARES (YOUDEN SQUARES) 
Plan 13.1% t *= 19, k = 10, r = 10, b = 19, X = 5, A = 95, Typo I 


Reps. 
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Plan 13.18 i - 21, h « 5, r « 5, b ** 21, X = 1, E « 84, Type I 

Reps. 
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INCOMPLETE LATIN SQUARES (YOU DEN SQUARES) 

Plan 1S.U t ~ 31, h « 6, r = 6, & ~ 31, X ~ 1, IS « 86, Type I 

Reps. 
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Plan 18.1a t * 37, k « 9, r ~ 0, 6 ® 37, X = 2, # « 91, Type I 

Reps. 
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INCOMPLETE LATIN SQUARES (YOUDEN SQUARES) 


Plan 18.16 


Plan 18.17 


Plan 13.18 


Plan 13.19 


Plan 18.20 


t = 3, k = 5, )• - 5, b = 3, X = 5, E = 96, Type III 
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Block I 

II 

III 

Reps 

IV V 

VI 

VII VIII 

(1) 1 

2 

3 

1 2 

3 

1 2 

(2) 2 

3 

1 

3 l 

2 

2 3 

(3) 3 

1 

2 

2 3 

1 

3 1 

t = 3, k = 10, r = 

10, 6 

- 3, X = 10, E 

“ 99, Type IV 

Block I II 

III 

IV 

Reps. 

V VI 

VII 

VIII IX X 

(1) 1 2 

3 

1 

2 3 

1 

2 3 1 

(2) 2 3 

1 

3 

1 2 

2 

3 1 3 

(3) 3 1 

2 

2 

3 1 

3 

1 2 2 

t = 4, & = 

5, r = 

5, 6 

- 4, X - 5, 


96, Type IV 


Reps. 


Block 1 

II 

III 

IV 

V 

(i) i 

2 

3 

4 

1 

(2) 2 

3 

4 

1 

4 

(3) 3 

4 

1 

2 

3 


(4) 4 1 2 3 2 
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Plan 13.21 t « 4, h « 7, r - 7, 6 - 4, X ** 7, « 98, Type III 

Reps. 

Block I II III iy y vi yii 

(1) 1 2 3 4 1 2 3 


(2) 

2 

1 

4 

3 

3 

4 

1 

(3) 

3 

4 

1 

2 

4 

3 

2 

(4) 

4 

3 

2 

1 

2 

1 

4 


Plan 13.22 t - 4, fc = 9, r « 9, 6 = 4, X « 9, J? * 99, Type IV 

Reps, 

Block I II III IV V VI VII VIII IX 


(i) 

1 

2 

3 

4 

1 

2 

3 

4 

i 

(2) 

2 

1 

4 

3 

3 

4 

1 

“T~ 

4 

(3) 

3 

4 

l 

2 

4 

3 

2 

i 

2 

(4) 

4 

3 

2 

1 

2 

i 

T~ 

~T~~ 

3 


Plan 13.23 t « 5, fc - 6, r « 6, & » 5, X « 6, JS' » 97, Type IV 

Reps. 

Block I II III IV V VI 


a) 

1 

2 

3 

4 

5 

1 

(2j 

2 

4 

5 

3 

1 

3 

(3) 

3 

i 

2 

5 

4 

4 

(4) 

4 

5 

1 

2 

3 

2 

(5) 

5 

3 

4 

1 

2 

5 


Plan IBM t - 5, k « 9, r - 9, b - 5, X « 9, E - 99, Type III 


Reps. 


Block I 

II 

III 

IV 

V 

VI 

VII 

VIII 

IX 

a) 

1 

2 

3 

4 

5 

1 

2 

3 

4 

(2) 

2 

3 

4 

5 

1 

4 

5 

1 

2 

(3) 

3 

4 

5 

1 

2 

2 

3 

4 

5 

(4) 

4 

5 

1 

2 

3 

5 

1 

2 

~3 

(5) 

5 ~ 

1 

2 

3 

4 

3 

4 

5 

1 
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Plan ISM t « 0, k - 7, r - 7, b « 6 , X - 7, E - 98, Type IV 


Reps. 


Block I 

II 

III 

IV 

V 

VI 

VII 

a) r~ 

2 

3 

4 

“IT" 

G 

1 

(2) 2 

3 

0 

1 

4 

5 

3 

(3) 3 

r> 

2 

5 

1 

4 

5 

(4) 4 

5 

1 

2 

0 

3 

4 

(5) 5 
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4 

6 

3 

2 

"~G 

(0) 6 

4 

5 

3 

2 

1 

2 


P/an tSM i « 7, fc * S, r - 8, b « 7, X « 8, # - 98, Type IV 


Reps. 


Block I 

II 

III 

IV 

V 

VI 

VII 

VIII 

(1) 

1 

2 

3 

4 

5 

G 

7 

1 

(2) 

2 

5 

1 

7 

6 

4 

3 

3 

(3) 

3 

6 

7 

2 

4 

1 

5 

7 

(4) 

4 

7 

5 

6 

3 

2 

L 

G 

(5) 

5 

4 

2 

3 

1 

7 

6 

5 

(6) 

6 

3 

4 

1 

7 

5 

2 

4 

(7) 

7 

1 

6 

5 

2 

3 

4 

2 



CHAPTER 14 


ANALYSIS OF THE RESULTS OF A SERIES OF 
EXPERIMENTS 


14.1 Initial Steps in the Analysis 

14.11 Introduction. In a program of research it is quite common to 
repeat the same experiment at a number of different places, on a number 
of different occasions. There may be several reasons for this. Some- 
times the object of the research is to produce recommendations which 
are to apply to a population that is extensive either in space or in time 
or in both. Thus in agricultural field experimentation, many projects 
are undertaken in the hope that their results can be applied in practical 
farming. The conclusions drawn from such research, if they are to be of 
use, must be valid for at least several seasons in the future and over a 
reasonably large area of farm land. It has been found that the effective- 
ness of the common plant nutrients, of different varieties of a crop, and 
of different cultivation practices usually varies from field to field and, 
even more markedly, from season to season. A single experiment, how- 
ever well conducted, supplies information about only one place and one 
season. Consequently such experiments are carried out at several dif- 
ferent places in the area for which recommendations are wanted, and are 
repeated for a number of seasons. 

In other cases we may be interested, not in making inferences about 
some specific population, but in studying the influence of external con- 
ditions on some measurement or on the responses to treatments. For 
example, is the vitamin A content of a vegetable affected by the climate 
in which it is grown? Do the relative durabilities of two types of road 
surface depend on weather or topography or traffic load? Repetition of 
such experiments in different places is necessary in order to have vari- 
ations in the external factors that are under investigation. 

A third example is supplied by collaborative experiments on biological 
assay by a group of laboratories* Here the objects may be to obtain in- 
formation about the accuracy with which the potency of a drug can be 
estimated and to discover whether different laboratories reach -the same 
conclusions about the relative potencies of different drugs. Bliss (14.1) 
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gives an interesting account of a series of experiments of this type, where 
sixteen laboratories estimated the potency of preparations of digitalis by 
injection into cats. 

The appropriate statistical analysis for the data from a series of ex- 
periments will of course vary with the object of the research. Never- 
theless, the preliminary stages of the analysis tend to be the same in all 
cases. In this chapter an introductory account is given of these initial 
procedures, which present difficulties that are not always appreciated. 
The first step, which is to analyze and interpret the data from each in- 
dividual experiment, will be assumed to have been completed. 

When we begin to combine the data, the first point of interest is to 
examine whether the differences among treatments are the same in all 
experiments. This question is likely to be relevant whatever the purpose 
of the experiments. In experiments designed to lead to the recommenda- 
tion of a “best” treatment for some operation, we wish to know whether 
there is a consistent superiority of certain treatments, or whether on the 
other hand we may have to consider recommending different treatments 
for different circumstances. In the other two types of experimentation 
mentioned above, the test would indicate whether the responses to treat- 
ments have varied with the external conditions of the experiment, or 
whether the different laboratories agreed in their estimates of relative 
potency. 

Secondly, it is often, though not always, desirable to estimate and 
compare the average effects of treatments over the whole series of ex- 
periments. 

14.12 Numerical Example. The data come from a group of 6 experi- 
ments on Irish potatoes conducted in two counties of North Carolina in 
1945 and 1 946. The results have been described by Nelson and Hawkins 
(14,2), and form part of a study of the responses to applications of super- 
phosphate on soils of varying degree of fertility. In 1945, on each of the 
6 sites, the amount of readily soluble phosphorus in the soil itself was 
estimated from soil samples by the modified Truog method. These 
amounts varied from 48 to 850 lb. P 2 0 5 per acre. The treatments com- 
prised 5 different levels of application, 0, 40, 80, 120, and 160 lb. P 2 0 5 
per acre. 

These experiments are an example of the second type discussed above, 
in that their object was to find out the extent to which the responses to 
treatments were influenced by the condition of the soil. The soils were 
not intended to be a representative sample of the soils in the counties, 
but were chosen so as to give a wide range in fertility. 
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Potatoes were grown on the plots in 1945 and 1946, the treatments 
being applied in both years, with in addition substantial applications of 
nitrogen and potash on all plots so that these nutrients would not be 
deficient. The data to be analyzed are the 1946 results. The mean yields 
and error m.s. per plot are shown in table 14.1. In five of the six experi- 

TABLE 14.1 Treatment yields (100 lb. per acre) of Irish potatoes, 1946 




Experiment number 


Pounds P 2 O 5 

l 

l 2 

1 3 

l 4 

1 5 

1 6 

applied 

Amount of readily soluble phosphorus 

per acre 


(pounds P 2 O 5 ) in soil, 1945 



48 : 

310 

410 

710 

790 

850 

0 

114 

142 

180 

170 

130 

170 

40 

206 

176 

188 

171 

140 

178 

80 

231 

201 

207 

188 

150 

187 

120 

237 

205 

208 

185 

152 

188 

160 

252 

217 

217 

389 

156 

189 

sr * 

106 

83 

158 

43 

204 

52 

d.f. 

21 

12 

12 

12 

12 

12 

r t 

4 

5 

5 

5 

5 

5 


* Error m.s. per plot. 

f Number of replications. 

ments the /'"-ratios for treatments against error were significant. One 
feature of the results in table 14.1 is that experiment 1, which has the 
lowest amount of available phosphorus in the soil according to the soil 
tests, actually gave the highest total yield in 1946. It should be re- 
membered that the soil tests were made in 1945, and that some residual . 
effects of the 1945 applications may have persisted. Whatever the rea- 
son, this soil was apparently very responsive to phosphorus in 1946; the 
plots without phosphorus do have the lowest yield of all places. 

It is possible to compute from table 14.1 a combined analysis of vari- 
ance for all six experiments. As will be seen later, there are limitations 
to the use of such an analysis. For the present we will ignore any com- 
plexities that may arise. Further, in order to discuss only the simplest 
case at first, we will omit experiment 1, which differs in size and structure 
from the other experiments. This omission is of course inappropriate 
from the agronomic point of view, but the data are being used to throw 
light .on the general procedure in analysis rather than on the agronomic 
questions involved. 
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14.13 Pieliminary Combined Analysis When all Experiments Have the 
Same Design. Formally, the analysis subdivides into the following 
components. 

Places 
Treatments 
Treatments X places 
Pooled experimental error 

In addition, in a complete analysis there are terms representing the dif- 
ferences among rows and columns in the individual experiments. Since 
these are not relevant to the interpretation, they are not included. 

The analysis can be computed from either treatment means or totals; 
the totals will be used here and are shown in table 14.2. 


TABLE 14.2 Treatment totals for experiments 2 to 6 (in 100 lb.) 


Pounds 

P 2 O 5 

2 

E 

3 

xperimei 

4 

it 

5 

6 

Total 

0 

710 

900 

850 

650 

850 

3,960 

40 

880 

940 

855 

700 

890 

4 , 265 

80 

1,005 

1,035 

940 

750 

935 

4,665 

120 

1,025 

1,040 

925 1 

760 

940 

4,690 

160 

1,085 

1,085 

945 

780 

945 

4,840 

Total J 

4,705 

5,000 

4,515 

3,640 

4,560 

22,420 


The first 3 components are computed in the same way as for a single 
randomized blocks experiment, except that all sums of squares are di- 
vided by an extra 5 in order to reduce them to a single-plot basis. Thus 
we have 


Places: 


(4705) 2 4 h (4560) 2 (22,420) 3 , , 

25 125 - = 41)367 


Treatments: 


(3960) 2 -| — • + (4840) 2 
25 


(22,420) 2 

125 


= 20,979 


To obtain the treatments X places s.s. we calculate the total s.s. for 
table 14.2, which comes to 69,199. Then 

Treatments X places: 69,199 — 41,367 — 20,979 = 6853 

Since all experiments have the same number of error degrees of free- 
dom, the pooled error m.s. may be found as the simple average of the 
five s 2 values in table 14.1. This comes to 108, as given in table 14.3. 

The null hypothesis that the treatment differences are the same at all 
places (i.e., that there are no treatment X place interactions) is tested 
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by the F-ratio 428/108, or 3.96, with 10 and 60 d.f., respectively. Since 
the 5% level is 1.81, the ratio is definitely significant. 

TABLE 14.3 Combined analysis op variance (on a single-plot basis) 


Source of variation 

d.f. 

s.s. 

m.s. 

Places 

4 

41,367 


Treatments 

4 

20,979 

5,245 

Treatments X places 

16 

6,853 

428 

Pooled error 

60 

6,480 

108 


A test of the average responses to treatments taken over all five places 
is of minor interest in this example, because the places do not constitute 
a random sample from a population about which we wish to make in- 
ferences. For purposes of illustration, however, we will assume that the 
places were selected as a random sample of fields on which potatoes might 
be grown commercially. 

In the F-test of the average effects of treatments there are two possible 
candidates for the denominator of F— the mean square for the treat- 
ments X places interactions (428) or the pooled error m.s. (108). Some- 
times this competition does not arise, because our knowledge of the data 
and the F-test of the interactions both indicate that there is no reason to 
suppose real interactions to be present. In that event the mean squares 
for interactions and error may be pooled to form a single denominator 
for the F-test of treatments. But frequently the experimental conditions 
are such that we expect interactions to be present. This is so in the ex- 
ample, where previous work in a number of countries has shown a rela- 
tion between the response to phosphorus and the amount found in the 
soil by a soil test. Consequently, even if the interactions m.s. had not 
proved significant, we might have been unwilling to assume that there 
were no interactions. 

In discussing the two F-ratios it is helpful to examine the mathematical 
model on which the combined analysis is based. If x# is the observed 
mean for the jtlx treatment at the ith place, we postulate that 

Xij « p + Ki + r; + mj + e>ij (14.1) 

where n, rj represent the effects of the place and the treatment, respec- 
tively, that of the treatment X place interaction, and % that of the 
experimental error. % is the average of the errors on the r plots that re- 
ceive the treatment at that place. 

With this model we can study the nature of the mean squares in the 
analysis of variance. If the experimental errors on individual plots have 
a variance <r e 2 , and if the interaction terms ju # may be considered to have 
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a variance the average values of the mean squares work out as fol- 
lows. 


TABLE 14.4 Expected values of mean squares 
Source of variation Expected value of mean square 


Treatments 
Treatments X places 
Pooled experimental error 


<r e 2 -F raf F 
< To + r(r V 2 


rp 


(*-l) 


2 ( T i ~ r ) 2 


The symbols r, p } and i stand respectively for the numbers of replica- 
tions, places, and treatments. 

It will be noted that the treatments m.s. is influenced by three com- 
ponents — the experimental error variance, the variance of treatment X 
place interactions, and the variance among the true treatment means 
Tj. The pooled experimental error takes account of only the first of these 
components. Hence the treatments m.s. may be statistically significant, 
as compared with the pooled error m.s., either because there are .real dif- 
ferences among the r ? s or because treatment X place interactions are 
present. This latest is informative only when we are indifferent as to 
whether a significant treatments m.s. was due to real treatment dif- 
ferences or to interactions. This situation is rare in practice. In most 
cases, on the contrary, having established that interactions are present, 
or at least that it is not safe to assume them absent, we wish to know 
whether in addition there are consistent differences among the effects of 
treatments. As table 14.4 shows, the appropriate denominator of F for 
this test is the interactions m.s. This contains both the interaction and 
error components of variation in exactly the same way as they enter into 
the treatments m.s. The F-ratio, 5245/428, or 12.26, with 4 and 16 d.L, 
is significant at the 1% level. 

The conclusions from this initial analysis are: (i) there are real dif- 
ferences in response that are consistent from place to place, and (ii) there 
are real variations in responsiveness from place to place. It need not be 
stressed that these statements do not constitute a competent summary of 
the results. They merely indicate what should be examined next. In 
the present case we would consider which treatments have proved con- 
sistently superior, and, if possible, why this is so. Similarly, we should 
investigate the nature of the interactions and try to find a rational ex- 
planation for them. 


14.2 Criticisms of the Preliminary Analysis 

14.21 Heterogeneity of the Interaction Variance. The previous anal- 
ysis is open to several criticisms. Although these criticisms are not valid 
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for all series of experiments, our experience is that it is well to be on one's 
guard against them. They deal essentially with the assumptions on 
which the combined analysis was based. 

These assumptions have been presented in section 14.13. Specifically, 
we postulate that the model (14.1) holds. Further, the experimental 
errors e# of individual observations are assumed to be normally and in- 
dependently distributed with the same variance cr e 2 , Finally, for the 
F-test of the treatments m.s, against the treatments X places inter- 
actions, we require also the assumptions that the interaction terms vu 
are normally and independently distributed, with zero population means 
and the same variance cr/, and are independent of the e’s. 

The first criticism is that some components of the treatments X places 
s.s. may be much larger than others, or in mathematical terms that the 
“interaction” variance or/ is not constant. This will happen if the ef- 
fectiveness of some treatments varies greatly from place to place, while 
that of others varies less or not at all. 

If the interaction m.s. is heterogeneous in this sense, the F-test of 
treatments against interactions is vitiated. The general effect is that the 
F value read from the tables is too low, i.e., that too many significant re- 
sults are obtained. Some idea of the extent of the bias can be obtained in 
certain extreme cases. In the numerical example, if one comparison 
among the treatment means has a much larger interaction variance than 
any other comparisons, the F-ratio is distributed approximately as an 
F value with 1 and 4 d.f., respectively, instead of 4 and 16 d.f. The 5% 
significance level would be 7.71 instead of 3.01. More generally, if there 
are p places, F is distributed approximately with 1 and (p — 1) d.f. 
This situation produces about the greatest distortion in F that is likely 
to arise, so that 7.71 could be regarded as an upper limit to the signifi- 
cance level of F, 

The exact distribution of F can be worked out, but is not available in 
a form adapted for practical use. Even so, in many instances, uncer- 
tainty about the correct significance level for F does not preclude us from 
drawing conclusions from the test. For, in the example above, if F 
turned out to be 1.38 we would be confident that it is not significant, since 
the 5% level is at least 3.01. Similarly, there seems little doubt that the 
F value actually obtained in the experiments, 12.26, is significant, since 
it lies well above 7.71. On the other hand, we would be uncertain about 
significance if F happened to lie between 3 and 7. 

A better method of coping with this difficulty is to divide the treat- 
ment s.s. into a set of orthogonal components that will supply all or most 
of the information of interest to us. The interactions s.s. is partitioned 
in the same way so as to isolate the interactions of each component with 
places. By Bartlett's test for homogeneity of variance, reference (14.3), 
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we can then test whether <r M 2 is the same for all the components of the 
interactions s.s. If we decide that cr/ can be assumed constant, the 
difficulty vanishes. If cr M 2 is not constant, it is valid to test any com- 
ponent of the treatments s.s. against its own interaction with places. 
Apart from the extra computations, the only drawback to this pro- 
cedure is that the degrees of freedom in the denominator of F are reduced. 

These remarks will be illustrated by the example. In these experi- 
ments the principal point of interest is to see whether the average re- 
sponse to a given amount of phosphorus decreases when the amount 
presumed to be in the soil increases. It may also be worth while to ex- 
amine whether the rate of decline in response with the higher levels of 
dressing changes with the nature of the soil. These questions can be 
considered conveniently by fitting at each place a parabolic regression 
of the yield on the amount of dressing. Although a parabolic regression 
would scarcely be regarded as the true form of the response curve, it ap- 
pears to fit the data well. 

Since the dressings increase by equal amounts, the orthogonal poly- 
nomials given by Fisher and Yates (14.4) are suitable. The calculations 
are shown below for experiment 3. The linear term is proportional to 


Multipliers for 


Amount 

Total 

Linear 

Quadratic 

of P 2 O 5 

yield 

term 

term 

0 

900 

-2 

2 

40 

940 

-1 

-1 

80 

1035 * 

0 

-2 

120 

1040 

1 

-1 

160 

1085 

2 

2 

Sum of products 

470 

-80 

Divisor for square 

50 

70 


the average increase in yield per 40 lb. of P 2 0 5 : the quadratic term may 
be interpreted as measuring the rate of decline in response with increased 
dressings. The values found for the two terms are as follows. 


Experiment 



2 

3 

4 

5 

6 

Total 

L 

895 

470 

260 

320 

240 

2185 

Q 

-325 

-80 

-70 

-100 

-no 

-685 


The contributions to the treatments s.s, are 

T t (2185) 2 (685) 2 

Linear:- — 19,097: quadratic: * 1341 

250 350 
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In the treatments X places s.s. we have 


L X places: 


(895) 2 + ...+ (240) 2 
50 


19,097 - 5893 


Q X places: 


(325) 2 H — * + (11Q)* 
70 


1341 = 645 


In table 14.5 the three interaction mean squares are in the same order of 
size as the corresponding treatment mean squares. This result is 
typical, in that large effects tend to have large interactions. It is obvious 
on inspection that the components of the interaction m.s. cannot be 
regarded as homogeneous. Accordingly, we test the linear component 
of treatments against the mean square 1473 instead of against the com- 
plete treatments X places m.s, of 428 in table 14.3. This makes a sub- 
stantial difference to the /'’-ratio, though it remains significant. The 
quadratic X places m.s. (161) is not significantly above the error m.s. 
(108), but it seems prudent to use it as the denominator in the F - test of 
the quadratic term in treatments. The remainder of the interactions 
shows no sign of significance, and could be pooled with error for an F~ 
test of the deviations from regression. 


TABLE 14.5 Subdivision of the analysis of variance 


Source of variation 
Treatments 
Linear 
Quadratic 

Deviations from regression 
Treatments X places 
Linear X places 
Quadratic X places 
Deviations X places 
Pooled error 


di. 

s.s. 

in.f?. 

1 

19,097 

19,097 

1 

1,341 

1,341 

2 

541 

270 

4 

5,893 

1,473 

4 

645 

161 

8 

315 

39 

60 

6,480 

108 


14,22 Heterogeneity of the Experimental Error Variances. A second 
criticism concerns the assumption that the experimental error variances 
( r e 2 are the same in all experiments. In general this assumption will hold 
only if all experiments have been conducted in the same way, with the 
same amount of control over environmental conditions and with experi- 
mental material of the same variability. In experiments with crops or 
animals this degree of uniformity is seldom attainable, because the 
natural variability among pieces of land or among animals at one place 
differs from that at other places. Hence in this and many other types of 
cooperative experimentation we expect a priori that experimental error 
variances change from place to place. 
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When there is doubt about this point, Bartlett's test of homogeneity 
of variances can be applied to the error m.s., s 2 , in the experiments. In 
the example (table 14.1) the s 2 values varied from 43 in experiment 4 to 
204 in experiment 5, and the test of homogeneity shows that <r/ cannot 
be assumed constant. 

Variation in a 2 invalidates the Attest of the interactions m.s. against 
the pooled error m.s. Although the effect on the significance level of F 
is not known exactly, it operates so that use of the tabular F produces 
too many significant results. As in the previous section, the extent of 
the distortion in F can be seen in extreme cases. If one experiment has 
a much higher error variance than any of the others, F will be distributed 
approximately as the tabular F with (t •— 1) and n f degrees of freedom, 
where t is the number of treatments and n f is the number of error degrees 
of freedom in the experiment with the high error variance. In the ex- 
ample, if one of the experiments with 12 error d.f. happened to be much 
less accurate than the other experiments, F would have 4 and 12 d.f. in- 
stead of 16 and 60 as used in the test in section 14.13. The 5% level 
would be 3.26 instead of 1.81. Since this case appears to be the most un- 
favorable that would occur, we may conclude that the true significance 
level of F lies somewhere between these values. 

If the observed F in our data falls outside these limits, a definite con- 
clusion can be reached from the test without further knowledge of the 
exact significance level of F. This happens in the example, where the 
observed F-ratio for the interactions m.s. is 3.96. However, even if the 
exact significance level were known, the test can be criticized because 
the F-ratio is no longer the most sensitive test criterion. Studies have 
shown that, with the amount of variation in <r e 2 that appears typical of 
agricultural experimentation, this loss of sensitivity might be equivalent 
to discarding 10 to 20% of the data. An approximate test that avoids 
some of the loss in sensitivity will be presented in section 14.4. 

Thus far we have been discussing how the F-test of the interactions is 
affected by heterogeneity in the error variances. The F-test of treat- 
ments may now be considered, * If the interactions are regarded as neg- 
ligible, so that the denominator of F is the pooled m.s. for interactions 
and error, the F-test of treatments is invalidated in about the same way 
as that of interactions. On the other hand, when interactions are 
present, and especially when they are large, the F-test of the treatments 
m.s. against the interactions m.s. is much less disturbed. The reason 
may be clearer if we consider the test of the linear component of treat- 
ments in table 14.5. The suggested denominator for this test was the 
linear X places component, 1473. This is nearly 14 times the pooled 
error m.s. It appears, therefore, that heterogeneity in the error variances 
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can influence only a small part of the interaction m.s., so that its effect 
is, as it were, greatly reduced. These remarks apply with much less force 
to the quadratic X places m.s., to which, taking the data at their face 
value, the pooled error contributes more than half. 

14.23 Summary. When all experiments are identical in structure, a 
combined analysis of variance can be computed relatively easily. Ap- 
plication of the ordinary tests of significance to this analysis is frequently 
open to question because of heterogeneity in the error and interaction 
variances. Despite this, it is advisable to draw preliminary conclusions 
as far as possible from this analysis, at least in the present state of our 
knowledge, because tests that are fully efficient and theoretically sound 
have not yet been discovered, and the approximate tests that have been 
devised to meet the criticisms are more laborious. 

When the interactions are sizable, there is usually little difficulty in 
interpreting the combined analysis. For, if the F-ratio for interactions 
is large, we may be confident that it is statistically significant in spite of 
some uncertainty about the exact significance level of F, and the crit- 
icism that the F-test is not fully sensitive carries less weight if F estab- 
lishes significance. Further, as we have seen, the F-test of the treat- 
ments m.s. against the interactions m.s. is little affected by inequality 
in the error variances when the interactions are large. The chief point 
to remember is that the interactions m.s. may itself be heterogeneous. 
Subdivision of the treatments and interactions m.s. according to the 
treatment comparisons that are of greatest importance is often 
useful. 

Another situation that presents little difficulty occurs when the aver- 
age differences among treatments are substantial, yet interactions are 
negligible. In this case it may be found that neither the F value for in- 
teractions nor that for treatments is close to the significance level. The 
cases that leave us in doubt are those where the F values are just slightly 
above the tabular significance levels. 

14.3 Experiments of Unequal Size 

14.31 Numerical Example. Provided that the same set of treatments 
appears in all experiments, a combined analysis can usually be con- 
structed even when the experiments differ in size and structure. Some 
issues arise, however, that are not encountered when all experiments are 
identical in design. The problems will be illustrated by the inclusion of 
the first experiment in the previous example. This was arranged in ran- 
domized blocks with 4 replications, whereas all other experiments were in 


k 
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5X5 latin squares*. The treatment totals and numbers of replications 
are shown in table 14.6. 


TABLE 14.6 Treatment totals (in 100 lb.) 


Pounds 



Experiment 



Total 

No. of 

p 2 o 5 

1 

2 

3 

4 

5 

6 

plots 

0 

456 

710 

900 

850 

650 

850 

4,416 

29 

40 

824 

880 

940 ! 

855 

700 

890 

5,089 

29 

80 

924 

1,005 

1,035 

940 

750 

935 

5,589 

29 

120 

948 

1,025 

1,040 

925 

760 

940 

5,638 

29 

160 

1,008 

1,085 

1,085 

945 

780 

945 

5,S4S 

29 

Total 

4,100 

4,705 

5,000 

4,515 

3,640 

4,560 

26,580 

! 145 

No. of plots 

20 

25 

25 

25 

25 

25 




A combined analysis can be obtained by following the standard pro- 
cedure for data based on unequal numbers. The square of any quantity 
is divided by the number of replications involved. The sums of squares 
are given below. 

m , , (m 2 + • • • + (1008) 2 t (71G) 2 + • * * + (045) 2 (26,580) 2 , 0i AOr 

Total: 1 * — ; — -tit— = 131,925 

4 5 145 


Places: 


(4160) 2 

20 


(4705) 2 4 1» (4560) 2 

T zz 


(26,580) 2 
145 


Treatments: 


(441 6) 2 4 j- (5848) 2 

29 


(26, 580) 2 
145 


55,509 

45,613 


Treatments X places: 131,925 - 55,509 - 45,613 


30,803 


The pooled error m.s. requires a little thought. If all experiments 
have the same true error variance <r e 2 , the best procedure is to weight 
each error m.s., s 2 , by the number of degrees of freedom But if the 
experiments vary in accuracy, this weighted mean is a biased estimate 
of the component of error variance that enters into the interaction and 
treatments m.s. The correct component is 2 ? WV2 T i instead of 
2 *WV 2 n v Consequently, unless we are confident that <? 2 is con- 
stant, it is best to weight the % 2 values by the numbers of replications. 
This gives 

. 4(106) + 5(83 + 158 + • * . + 52) 

h 2 — 108 

29 

* Since experiment 1 contained additional treatments (not discussed here), it pro- 
vided 21 error dX 
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The F-test of the interactions m.s. against the error m.s. is carried out 
in the same way as with experiments of equal size, and is subject to the 
criticisms previously discussed. The /'’-test of the treatments m.s. 
against the. interactions m.s. encounters a new difficulty that is due to 

TABLE 14.7 PRELIMINARY ANALYSIS OF VARIANCE FOR EXPERIMENTS 
OF UNEQUAL SIZE 


Source of variation 

d.f. 

S.S. 

m.s. 

Places 

5 

55,509 

31,102 

Treatments 

4 

45,613 

11,403 

Treatments X places 

20 

30,803 

1,540 

Pooled error m.s. 

81 


108 


the unequal numbers of replications and is present even if all the as- 
sumptions required for the analysis of variance are satisfied. Under 
these assumptions, the expectations of the principal mean squares in 
table 14.7 are as given in table 14.8. 


TABLE 14.8 Expected values of mean squares (with unequal numbers of 

replications) 


Source of variation 
Treatments 


Expected value of mean square 


where 


Treatments X places a? -j~ f 2 <r^ 2 

Pooled error ay 2 


h 


2>^14l 
2>i ~ 20 


= 4.862 


1 „ 1 

h - 7 rr (E n - fi) « - (29 - 4.862) - 4.828 

(p - 1) 5 


The complicating factor is that the coefficient of c r* in the expected 
treatments m.s. is not the same as that in the expected treatments X 
places m.s.; in fact, it is always larger. The difference is very small in 
this example, as it is whenever the experiments do not vary much in 
numbers of replications. In addition, and for the same reason, the in- 
teractions m.s. is not distributed as a multiple of chi-square; that is, it 
does not have the type of distribution required for the validity of the 
F-test. Both factors tend to make the E-test give too many significant 
results. 

A crude method which at least prevents us from being led too far 
astray is to adjust the F-ratio so as to remove the upward bias in ex- 
pectation. Let the expected values of the three mean squares be de- 
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noted by 8 t , 6 tp , and 0 C , respectively. Then, if the null hypothesis is true 
(all Tj equal), it is seen from table 14.8 that the three expected values are 
connected by the equation 

Qt T (k — l)0e 5=5 kOtp 

where k = h/h. In other words, the test that we seek is a test that this 
relation holds. This suggests that F might be computed as 

,a + (ft - !)»,» 11,403 + 0.007(108) 

ji s=s — — - — — •— — 7 .3o 

ks tp 2 (1.007) (1540) 

as compared with the original F-ratio of 11,403/1540, or 7.40. The sig- 
nificance level of F r will also be altered slightly from that in the table. 

When interactions are likely to be large, and the chief interest centers 
in the test of the treatments m.s. against the interactions m.s., an al- 
ternative approach has much to commend it. This is to compute the 
analysis from the unweighted treatment means given in table 14.1, paying 
no attention to the numbers of replications. With this type of analysis 
the bias disappears, the expectations of the mean squares being shown 
in table 14.9. It will be noted that <r c 2 and a 2 carry the same coefficients 


TABLE 14.9 Expected values op mean squares in an unweighted 

ANALYSIS 

Source of variation Expected value of mean square 

rfs j, , , O , 11 (?j f) 2 

Treatments h vu, T V — r M 7 : — 

fh (t - 1 ) 

2 

Treatments X places — + <r M 2 
fh 


in both expectations. The divisor fh is the harmonic mean of the num- 
bers of replications, given by fh = vflL? (IAv). Moreover, if <r M 2 is much 
larger than the interactions m.s. tends to be distributed as a 

multiple of chi-square, so that the conditions for the F-test are closely 
approximated. 

It is suggested that the pooled error m.s. for insertion in an analysis 
of this type be calculated from the formula 




o & 
s 2 


+ * • * + 


r v / 


By an extension of the results in table 14.9 to the case where the experi- 
ments have different e^or variances, this quantity is found to be an un- 



14.32 COMBINED ANALYSIS FOE LATTICE EXPERIMENTS 


405 


biased estimate of the error component that enters into the mean squares 
for treatments and treatments X places. For the data in table 14.1 we 
have 

, 1 /106 83 + 158 +•• - + 52\ 

5 -'"5vr + i j- 22 - 4 

TABLE 14,10 In weighted analysis of variance of treatment means 


Source of variation 

d.f. 

s.s. 

m.s. 

Places 

5 

11 , 602 

2,338 

Treatments 

4 

10,554 

2,038 

Treatments X places 

20 

7,158 

358 

Pooled error 

81 


22. 


Tabic 14.10, being in terms of treatment means, is in different units 
from table 14.7 (p. 403). As a conversion factor, the harmonic mean f h> 
which in this case is 4.8, may be used to multiply the mean squares above 
for comparison with those in table 14.7, though this conversion is not 
needed for our purpose. 

In a choice between a weighted and an unweighted analysis, the fol- 
lowing are the relevant considerations. The statements below are 
strictly true only if all experiments have the same error variance per 
observation (or plot), though they should remain substantially true with 
a moderate variation in <r* 2 . The weighted analysis is superior for the 
F- test of the interactions. It gives a more powerful test, and the F-raiio 
approximates the tabular distribution of F more closely. It is also 
superior for the F- test of treatments if interactions are non-existent or 
small, since in this case the bias in the test is negligible. The unweighted 
analysis is preferable for the latest of treatments when the interactions 
are not negligible. As it happens, the results in the example are not too 
well in accord with these statements. For the interactions test, the F- 
ratio is 14.20 with the weighted analysis and 15.98 with the unweighted 
analysis. The explanation of the higher value with the unweighted 
analysis is probably that experiment 1, whose responses differed from 
those in the other experiments, had the smallest number of replications. 
The values of F in the test of treatments were practically identical, being 
7.35 (after adjustment) in the weighted analysis and 7.37 in the un- 
weighted analysis. 

14.32 The Combined Analysis for a Series of Lattice Experiments. In 
the discussion of lattice experiments it was pointed out that if inter-block 
information is recovered, the latest of treatments in an individual ex- 
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periment is not exact, because the relative weights attached to inter- and 
intra-block estimates are subject to sampling errors. For the same rea- 
son any combined analysis must also be approximative rather than exact. 

The designs need not be identical at all places. With 25 treatments, 
for example, some experiments might be simple lattices in 4 replications, 
while others are lattice squares in 3 replications. Also, inter-block in- 
formation need not be recovered in all analyses; at some places we might 
have used a randomized blocks analysis. 

If the experiments have different numbers of replications, the pre- 
vious section has indicated that there is a question whether to use a 
weighted or an unweighted analysis. The former is preferable when the 
chief purpose is to test the interactions with places, and when such in- 
teractions are negligible; the latter when interactions are sizable and the 
principal interest is in a test of the average effects of treatments. With 
lattice experiments as used in agriculture, we are often interested in both 
interactions and average effects. But as interactions are seldom absent, 
the unweighted analysis is frequently advisable, and has the virtue of 
being slightly simpler. 

The weighted analysis will be described first. First form a two-way 
“treatments X places” table similar to table 14.6. The entries in the 
table are the adjusted treatment totals, except in experiments analyzed 
by randomized blocks, where unadjusted totals are used. This table is 
analyzed into components for 


Places 
Treatments 
Treatments X places 


If the numbers of replications r* differ, the weighted analysis follows the 
same procedure as in table 14.6 (p. 402). The square of every marginal 
treatment total is divided by r i in order to reduce the analysis of vari- 
ance to a single-observation basis. The square of a place total, summed 
over all treatments, is divided by tr iy where t is the number of treatments. 

The only additional component is the pooled error m.s. In any ex- 
periment where inter-block information has been recovered, we use as 
the estimate of the error variance per observation (or plot) the effective 
error m.s. Ej. This quantity is obtained by adjusting the intra-block 
m.s. upwards so as to allow for sampling errors in the block corrections, 
and its calculation is explained along with the analysis for each type of 

vkix "1 

lattice. For the triple lattice, for example, Ej is E e 1 + |. If 

k + 1 


there are any experiments in which the intra-block analysis has been 
carried out, Ej is given the value which it takes when Ei becomes very 
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large relative to For experiments analyzed by randomized blocks, 
Ej is the ordinary error m.s. 

On the assumption that the true effective variances are unlikely to be 
the same in all experiments, the pooled error m.s. is calculated as 
23 nE// 23 r i- When n is constant, this reduces to the unweighted 
mean of the E/ . Tests of significance are made as in the numerical ex- 
ample, and are subject to the same general criticisms. Since the larger 
lattice designs provide ample degrees of freedom for error, the pooled 
error may be relatively well determined even if some experiments are 
much more accurate than others, so that there is less uncertainty about 
the true significance levels of the F~ratios. 

For an unweighted analysis with experiments of unequal size, con- 
struct a two-way table of adjusted treatment means. This is analyzed 
by the simple standard procedure. The comparable pooled error m.s. is 
taken as 


-2 1 ( B i' , E * 

= - ( + ~ 

V \ n r 2 


+••• + 



With balanced designs, the method given above is a natural extension 
of the technique used to obtain an approximate F-test of treatments in 
an individual experiment. With partially balanced designs the method 
is more crude than that used in single experiments, where a special sup- 
plementary calculation was made for the F-test. This calculation can be 
extended to apply to a combined analysis, but it is doubtful whether the 
elaboration is worth while. 


14,4 A Test of the Treatments X Places Interactions 

In this section we give an approximate test of the treatments X places 
interactions for series of experiments where there is considerable varia- 
tion in the experimental error variances. It was suggested that the F~ 
test, despite its imperfections, will often serve our purpose. The present 
test differs from the F-test in that it gives less weight to experiments 
which have a high error variance, and consequently may be expected to 
be more sensitive. It may be useful in cases where there is doubt about 
the verdict given by the F-test and in cases where the most efficient 
analysis is desired. 

As before, let Xij be the mean of the jth treatment, and at 2 be the true 
error variance per observation in the ith experiment. We assume that 
all experiments have the same design, and let s 2 be the error m.s. per ob- 
servation in the ith experiment, based on n degrees of freedom. 

For known values of <r 2 the theory of least squares indicates that the 
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mean x# should receive a weight Wi = v/<jA In a weighted analysis of 
variance of the treatment means, the interaction s.s. is known to be 
distributed as chi-square with (p — 1)(^ — 1) degrees of freedom. This 
sum of squares appears to be the best test criterion available, unless we 
possess specific information about the type of interaction that may exist, 
in which event a more specialized criterion would be constructed. 

In default of knowledge of the cr/, the natural step is to consider a 
weighted analysis with weights » r/aA That is, we weight each 
mean inversely as its estimated variance, since we do not know its true 
variance. This is satisfactory, provided that the s/ are good estimates 
of the Upon examination it appears that the s/ should be based on 
at least 15 di. To illustrate the calculations, the test will be applied to 
the data from the last 5 experiments, though the number of degrees of 
freedom, 12, is slightly too low to use the test with full confidence. 


TABLE 14.11 Treatment mean yields (100 lb. per ache) eor a weighted 

ANALYSIS OF VARIANCE 


Pounds 


Experiment 

; 

1 Weighted total 

P 2 O s 

2 

3 

4 

6 

6 

M 

g 

Jk 

il 

0 

142 

180 

170 

130 

170 

53.570 

40 

176 

188 

171 

140 

178 

57.000 

80 

201 

207 

188 

150 

187 

62.194 

120 

205 

208 

185 

152 

188 

62.264 

160 

217 

217 

189 

156 

189 

63.932 

Wi 

0.060 

0.032 

0.116 

0.025 

0.096 

0.329 - W 

Total (Pi) 

941 

1000 

903 

728 

912 


WiPi 

56.460 

32.000 

104.748 

18.200 

87.552 

298.960 « G 

S.B. (A) 

180,655 

200,946 

163,431 

106,440 

166.618 



The arrangement of the data in table 14.11 should be noted. The treat- 
ment means and sf values come from table 14.1 (p. 393). The weights 
for individual entries in the table are placed in the row immediately be- 
low the entries; for example, Wi = 5/83 « 0.060. The total of the 
weights is denoted by W. 

1. Form the column totals Pi and the products Form the 

weighted row totals. The comer total, O * 298.960, supplies a check 
on both the column and row totals. 

2. The items in the analysis of variance are obtained as follows. 


Correction term: 



tw 


(298.960) 2 

(SKO^O 


54,332.57 
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Total: First compute and record in the table the sum of squares Si 
(uncorrected) of the entries in each column. The total s.s. is then found 
as 


X) (».■&) - C - (0.060) (180,655) f (0.096) (166,618) - C = 

551.33 

Places: A column (place) total has variance taf/r, and hence receives 
a weight Wi/t. The sum of squares is 

(WiFi 2 ) — c - 230.09 


This is found readily by using the products WfP,-. 

Treatments: Each treatment total has estimated weight IF. The sum 
of squares is 


w 


~ c = 


(53.570) 2 + ••• + (63.932) 2 
0329 


229.57 


Treatments X places: Since this is the only component in which we are 
currently interested, it is unfortunate that it must be found by means of 
the others. As usual, it is given by 

551.33 - 230.09 - 229.57 = 91.67 = I 


The sum of squares I does not follow a chi-square distribution, being 
inflated by errors in the weights. It can be reduced to a quantity that is 
distributed approximately as chi-square. * We take 


(n — 4) (w — 2) 
■n(n -f - 1 — 3) 


. A , (p - m - D(» - 4) 

with — 


(n -1- t - 8) 


d.f. 


In this case 


2 

x 


( 8 ) ( 10 ) 

(12) (14) 


(91.07) = 43.6.5, 


with 


( 16 ) ( 8 ) 

(14) 


9.14 d.f. 


In this approximation the degrees of freedom ascribed to chi-square are 
not integral. Significance levels are obtained by linear interpolation in 
the tables of chi-square. Since the 1% levels of chi-square are 21.67 and 
23.21 for 9 and 10 d.f., respectively, I is obviously significant. 

The test can also be applied to any component of the interactions s.s. 
As an example, we will test the interaction of the quadratic component 
of the regression on amount of phosphorus. This was previously tested 
by the E-test in table 14.5, and found non-significant. The values given 

* This test is an extension and modification of a test previously given by Cochran 
(14.6). 
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for the quadratic components in section 14.21 were derived from treat- 
ment totals; to obtain corresponding values from means, we divide by 5. 

Experiment 


; 

2 

3 

4 

5 

6 

Total 

Qi 

WiQi ■ 

—65 

- 3.900 

-16 

- 0.512 

-14 

- 1.624 

-20 

- 0.500 

-22 

- 2.112 

-8.648 


Each quantity has variance 14 cr//r, and receives a weight Wj/ 14. The 
sum of squares is 

4- (£ WiQi 2 ~ = 1 (340.89 - 227.32) = 8.11 = I q 


The quantity I q is the sum of squares for the interaction of a single 
treatment comparison with all p places, and would normally have 
(p — 1) degrees of freedom. To present the general formula for con- 
version to chi-square, suppose that we have computed the sum of squares 
for the interaction of ti treatment comparisons with pi of the places, so 
that there would normally be t x (pi — 1) degrees of freedom. Then 


X 


2 


in - 4 ){n - 2) fifth - 1 ){n - 4) 

. with - ~ ’ “ ~ " di, 

n(n + ti — 2) (n + h ~ 2) 


This fonnula is in accord with that used for the whole ( t — 1) ft? — 1 ) 
degrees of freedom in the interactions. Since the t treatments provide 
(f — 1) independent treatment comparisons, we would take t-x ~ {t — 1) 
in applying the formula to the complete interactions s.s. For t\ — 1, 



(8) (10) 
(12)(11) 


( 8 . 11 ) 


= 4.92, 


with 


(4) (8) 

'(U) 


2.91 d.f. 


The tables show that this value has a probability between 0.2 and 0.1 , 
since by interpolation the 20% level for 2.91 d.f. is 4.51, the 10% level 
6.10. The earlier F-test gave a probability slightly over 0.2, so that the 
two tests are in close agreement. 

If the experiments differ in size, the weight w, becomes n/s 2 . The 
numbers of error degrees of freedom will also vary. This necessitates a 
more elaborate conversion formula: as a x-ough approximation the avex-- 
age number of error degrees of freedom per experiment is used in place 
of n. 

The test is not recommended for low values of n, because the s,- 2 are 
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relatively poor estimates of the a i 2 i and frequently one or two experi- 
ments receive such high weights that they dominate the analysis. Ap- 
propriate methods are presented by Yates and Cochran (14,5) and 
Cochran (14.0). 


14 J Repetitions in Both Space and Time 

As has been mentioned, agricultural field experiments are often re- 
peated both at a number of places and for a number of years. We will 
give only an introduction to the simplest case, in which experiments have 
been carried out at each of p places for y years. At any place a new site 
is chosen each year for the experiment, and a new randomization em- 
ployed, so that the data from successive years may be regarded as 
independent. The experimental arrangement need not be uniform 
throughout the whole series of experiments: in fact, a change in design 
between seasons is not uncommon. 

The mathematical representation of the mean of the jth treatment at 
the Ah place in the kill year is now more lengthy. In addition to the 
symbols previously used for place and treatment effects, will denote 
the effect of the year. Interactions are denoted by multiple symbols: 
thus (? rr)ij is the contribution of the place X treatment interaction in 
this experiment. With this notation 

Xijh = p + Ti + Tj + 7k + (ir r)ij + ('iry)ik + (jy)jk + 0rr7)#fc* + &i]k 

Note that there are treatment X place, treatment X year, and treat- 
ment X place X year interactions. 

In experimental programs of this type, it is usually hoped that the 
places and years constitute a representative sample of the population of 
places and years to which the results will be applied. There are obvious 
practical difficulties in choosing places and years that can be confidently 
asserted to be such a representative sample, and sometimes little effort 
is made to ensure that this will be so. The hard fact is that any sta- 
tistical inferences drawn from an analysis of the data will apply only to 
the population (if one exists) of which the experiments are a random 
sample. If this population is vague and unreal, the analysis is likely to 
be a waste of time, at least from the strictly practical point of view. 

It seems appropriate to regard all the quantities in the equation as 
random variables except the general mean m and the true effects n of the 
treatments, because if we could tabulate all the values of say (tt)# in 
the population, they would follow some frequency distribution from 
which the values in our data are a sample. For the full application of 
the analysis of variance we require the assumptions that the expert* 
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mental errors and all interactions of treatments are normally and in- 
dependently distributed, with variances that are denoted by use of the 
letters that enter into the interaction. 

An unweighted analysis of variance of the treatment means is ob- 
tained by the standard procedure for factorial experiments. The im- 
portant parts of his analysis are sketched in table 14.12. 


TABLE 14.12 Analysis of variance of treatment means with time and 

PLACE VARIATIONS 


Source of 
variation 

d.f. 

Expectation of mean square 

Treatments 

a — i) 

0V" q. _j_ y atp » -j- py ^ ’ l) 

Treatments 



X places 

a - 1 )( P - 1) 

~r &tpy ■+• yvtpT 

Treatments 



X years 

(t - l)(y - 1) 

+ crt V y 2 + pa ly 1 

Treatments 



X places 

X years 

(t - «(p - m - 1) 

2 i 2 

< J c *T #tpy „ 

Booled error 


o 

er v ~ 


In the most general case, <r c 2 is the average value of <r^ 2 /r^, where 
is the error variance in the individual experiment at the iih place in 
the Ath year, and ?\k is the number of replications in that experiment. 
Consequently, the pooled error should be estimated as the average of 
Stf? / F i k* 

The method to be followed in testing the significance of the successive 
terms is made clear by the expected values of the mean square. The 
treatments X places X years interactions are tested against the pooled 
error. Both treatments X years and treatments X places are tested 
against treatments X places X years. 

The situation with regard to the test for treatments is interesting. It 
is evident that no other mean square in the analysis is suitable as a de- 
nominator of F, unless either the treatments X places or the treat- 
ments X years interactions appear to be negligible, so that the corre- 
sponding variance is assumed to be zero. If this happens, the other 
interaction m.s. is an appropriate denominator for F (subject to some 
uncertainty in case the assumption should not be correct). 

When both two-factor interactions are present, the hypothesis that 
all Tj are zero is equivalent to the hypothesis that 


+ Qtpy = &tp + 0t y 
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where the 0’s stand for the expected mean squares in table 14.12. No 
exact test tor this kind of relationship is at present known. By analogy 
with the F-frest, one suggestion is to use the ratio 

<V * 1 ),/ 

+ S t / 

where the ts J values are the mean squares in the analysis of variance. 
This ratio does not follow the /'-distribution, but following an approxi- 
mation suggested by Satterth waite (14.7) and others, we might use the 
/'-tables with n/ and %' degrees of freedom, where 

, (*r + W)* 

n i -~4 r 

^ &lpy 
%t ftstpy 

where the n’s are the numbers of degrees of freedom in the corresponding 
mean squares. The analogous expression is used for n 2 . 

After completion of the analysis, the next step is to examine the nature 
of the interactions and average effects of the treatments, bringing to bear 
any external knowledge that will throw light on the interpretation. 
Often, for the practical uses of the results, we would like to decide 
whether a single recommendation can be made for application in the 
whole population, or whether, on the other hand, it is necessary to have 
several recommendations for different parts of the population. These 
decisions are facilitated by calculating confidence limits for the dif- 
ferences among the means of the most likely candidates. For a discus- 
sion of methods for making this calculation, which can give rise to some 
complications, see reference (14.5). 
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RANDOM PERMUTATIONS OF 9 AND 16 NUMBERS 

16.1 Use of the Random Permutations 

In practically all the experimental designs, the randomization con- 
sists in arranging a set of objects, whether treatments, blocks, rows, or 
columns, in random order. If the number of objects does not exceed 9, 
a randomization is obtained at once from table 15.6, which contains 
1000 random arrangements of the numbers from 1 to 9. For example, 
to arrange 7 treatments in random order, select a starting place in table 
15.6 (without inspection of the numbers in the table). Suppose that the 
permutation chosen is 

18926753 4 
Omitting the digits 8 and 9, we obtain 

^1267534 

for the desired arrangement. Similarly, table 15.7, which contains 1000 
permutations of the numbers between 1 and 16, may be used for any 
number of objects between 2 and 16. 

Occasionally the randomization involves dividing the objects into 
groups, as in the completely randomized and cross-over designs. Thus, 
with a cross-over design (section 4,4) having 2 treatments and 10 blocks, 
treatment A must appear in the first row in five of the blocks, chosen 
at random, and in the second row in the remaining five blocks. Select a 
random permutation of the numbers up to 16 from table 15.7, say 

6 16 10 5 12 7 9 4 1 2 15 14 13 11 8 3 

The numbers from 11 to 16 are ignored. The first five of the remaining 
numbers, 6, 10, 5, 7, and 9, are chosen as the blocks in which *4 appears 
in the first row. 

If the number of objects to be randomized exceeds 16, see section 15.3. 

16.2 Construction of the Random Permutations 

The idea of using random permutations instead of the ordinary ran- 
dom digits was suggested by Professor George W. Snedecor. Before the 
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CONSTRUCTION OF THE RANDOM PERMUTATIONS 

writing of this book, 200 random permutations of each of the numbers 
from 3 to 13 had been made at the Statistical Laboratory at Iowa State 
College. These were obtained by first constructing a large number of 
ordinary random digits, which were then transformed into random per- 
mutations. The method of transformation was the usual one: it may be 
illustrated for the numbers from 1 to 11. To obtain a random per- 
mutation of the numbers 1 to 11, a series of pairs of random digits is 
taken, say 

05, 04, 29, 82, 37, 52, 53, etc. 

When we divide each number by 11 the remainders are 

10, 4, 7, 5, 4 (omitted), 8, 9, etc. 

This process is continued until 10 of the 11 numbers have appeared, 
repetitions being omitted. 

Rather than present 200 permutations of each of a series of numbers, 
it was decided to concentrate on two numbers. This decision was made 
partly because any permutation of 9 numbers, say, can be used to give 
permutations of all numbers less than 9. Thus 1000 permutations of 9 
numbers are more useful than 200 of each of the numbers 9, 8, 7, 6, and 
5. Also, certain numbers, such as 11 and 13, are less often used in ex- 
periments than others. The numbers 9 and 16 were selected for presen- 
tation because of their occurrence in factorial experiments. 

The 1000 permutations of 9 were obtained from the permutations 
already available for 9, 10, 11, 12, and 13, by omitting the numbers 
above 9. Of the permutations of 16, 800 were obtained from random 
digits. With as many as 16 numbers to permute, the method of division 
as illustrated above is rather slow. A more rapid alternative, suggested 
by Mr. Paul Peach, was used. In this method 16 pairs of random digit s 
are taken, for example, 

75 10 17 28 91 85 74 56 71 06 26 01 06 21 07 60 

14 5 0 9 16 15 13 10 12 3 8 1 2 7 4 11 

The permutation is produced by ranking the pairs in order of size, as 
shown below each pair. In the event of a tie, as happens in this example 
with the two Q6 J s, the two ranks in question may be allotted from an ad- 
ditional random digit. Ties occur frequently (the probability that all 
16 pairs are distinct is about .35) but cause little delay. Although the 
method has the additional merit of using fewer random digits than the 
method of division, with 32 random digits per permutation a large supply 
of random digits is required. The remaining 200 permutations were ob- 
tained by drawing numbered marbles from an urn. 
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15*3 Randomization of More than 16 Numbers 

The most suitable method depends on the facilities available and on 
the frequency with which randomizations have to be made. Navy beans, 
drawn from a box, have been found quite expeditious and convenient, 
though the numbers on the beans rub off with repeated usage. Al- 
ternatively one of the standard sets of random digits, references ( 15 . 1 - 
15 . 4 ), may be used. For example, with 25 numbers to be placed in ran- 
dom order, choose a starting place in the table and select the next 25 
sets of three-digit random numbers. These numbers are then placed in 
increasing order by the method used in the previous section for the per- 
mutations of 16. Three-digit random numbers are taken instead of two- 
digit numbers in order to avoid ties. With three-digit numbers the fre- 
quency of ties is negligible even if as many as 100 numbers are to be 
randomized. If punched card machines are available, the numbers 1 to 
25 may be punched in columns 1 and 2 and the random digits in col- 
umns 3, 4, and 5 of the card. The permutation may then be produced 
and printed by sorting on columns 3 to 5 . By filling all the columns from 
3 onwards with random digits, a number of permutations can be obtained 
by sorting on different sets of columns, though the number is limited if 
the permutations are to be kept independent of each other. 

15.4 Tests of Randomness 

Many tests of different aspects of the randomness of the permutations 
could be made. So far as their use in experimental design is concerned, 
the test most immediately relevant is a test of the null hypothesis that 


TABLE 15.1 Number of occurrences of 1, % *• *, 9 in 1st, 2nd, 
9th position in 1000 permutations of 9 


Number 

1 

2 

3 

Position 

4 5 

6 

7 

8 

9 

Totals 

1 

130 

111 

m 

108 

94 

121 

114 

106 

105 

1000 

2 

120 

118 

104 

102 

102 

133 

98 

108 

115 

1000 

3 

92 

100 

116 

111 

113 

110 

116 

123 

119 1 

1000 

4 

121 

98 

97 

108 

117 

112 

124 

102 

121 1 

1000 

5 

95 

103 

120 

140 

122 

84 

113 

113 

110 

1000 

6 

107 

118 

118 

88 

113 

120 

124 

111 

101 

1000 

7 

108 

119 

121 

124 

110 

94 

95 

113 

116 

1000 

8 

115 

116 

no 

98 

107 

109 

117 

121 

107 

1000 

9 

112 

117 

103 

121 

122 

117 

99 

103 

106 

1000 

Totals 

1000 

1000 

1000 

1000 

1000 

1000 

1000 

1000 

1000 

1000 



TABLE 15.2 Number of occurrences of 1, 2, •**, 16 in 1st, 2nd, • * *, 16th position in 1000 permutations of 16 
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the numbers 1, 2, * * *, 9 have an equal probability of appearing in the 
1st, 2nd, •*, 9th positions. This test is made by counting the number 
of times that each number has occurred in each position. The results 
are shown in two-way presentations in tables 15.1 and 15.2. 

For the permutations of 9, the expectation in each cell is 1000/9, or 
111.1. Each row and each column in table 15.1 adds to 1000. Taking 
the values in a single column, we may test by % 2 whether the 9 numbers 
are equally represented in the corresponding position. If the a/s are the 
observed entries in the column, 


(x ~~ m )' 2 


m 



joooy 
T / 


For computational purposes, this simplifies to 

x 2 = ToVoZ (* 2 ) - 1000 

Thus, for the first position, 

X 2 = TcfW(130 2 + 120 2 + “ * + 112 2 ) - 1000 - 10.81 

with 8 d.f. Similarly, from the rows we may test whether any given 
number has appeared equally often, apart from sampling fluctuations, 
in each position. Although the two sets of tests are not independent, 
both are of interest. The values of % 2 are shown in table 15.3. 


TABLE 15.3 Values of x 2 from 1000 permutations of 9 


Position Number 


test 

X 2 

test 

0 

X“ 

1 

10.81 

1 

7.46 

2 

5.19 

2 

9.17 

3 

5.08 

3 

6.70 

4 

17.16 

4 

7.75 

5 

6.06 

5 

18.90 

6 

15.52 

6 

8.97 

7 

8.82 

7 

8.25 

8 

3.88 

8 

3.45 

» 

3.44 

9 

5,32 

Totals 

75.96 


75.97 


In the position tests, the individual probability values range from 
.90 (x 2 - 3.44) to .028 ( x 2 = 17.16). With 8 d.f., the probability that 
the largest of 9 independent values of x 2 should exceed 17.16 works out 
at .23. Actually, owing to the fact that the cell numbers add to 1000 
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in any row or column, the 9 values for the position x 2 have a small pos- 
itive correlation which reduces the probability value for the largest x 2 
to about .20. The individual probability values for the number tests all 
lie between .90 (x 2 = 3.45) and .015 (x 2 = 18.90). The probability that 
the largest x 2 should exceed 18.90 is about 0.11. Thus neither series of 
tests indicates any marked departure from randomness. 

The total, 75.97, which is ( x ~~ m) 2 /m taken over all cells in table 
15.1, supplies a composite test of the complete table. On account of 
the positive correlation referred to in the previous paragraph, the value 
75.97 does not itself follow a x 2 distribution. In fact, it may be shown 
to be distributed approximately as (9/8) x 2 , where x 2 has 64 di. Con- 
(8) (75.97) 

sequently we may test — L - — , or 67.52, as a x 2 value with 64 d.i 

tf 

As would be expected from the results of the previous tests, the prob- 
ability value, .36, provides no evidence for the rejection of the null 
hypothesis. 

The corresponding x 2 values for permutations of 16 are given in table 
15.4. 

TABLE 15.4 Values of x 2 from 1000 permutations of 16 


Position Number 


test 

o 

X 

test 

<> 

X“ 

1 

14.40 

1 

20.25 

2 

17.70 

2 

12.96 

3 

11.87 

3 

16.58 

4 

15.14 

4 

4.80 

5 

17.70 

5 

14.94 

6 

7.42 

6 

9.57 

7 

12.29 

7 

13.98 

8 

7.42 

8 

13.63 

9 

24.54 

9 

15.55 

10 

9.15 

10 

12.45 

11 

10.62 

11 

14.05 

12 

16.51 

12 

18.56 

13 

9.92 

13 

14.50 

14 

18.85 

1.4 

13.06 

15 

6.98 

15 

14.78 

16 

19.36 

16 

10.21 

Totals 

219.87 


219.87 


The only figure in the group which has an unusual probability is the 
low value 4.80 for number 4. This has an individual probability of .9932 
(15 d.f.). When account is taken of the fact that this is the smallest of 



420 


RANDOM PERMUTATIONS OF 9 AND 16 NUMBERS 


15.4 


the 16 values for the number tests, the probability is found to be .10. 
The total, 219.87, is distributed as (16/15) x 2 with 225 d.f. and is found 
to be slightly but not abnormally below its expectation. 

A further test of randomness was made by counting the number of 
inversions for each permutation. The inversions are counted as follows. 
For each number, record how many numbers to the left are greater than 
the number. Thus for the permutation 


we record 


3 4 8 1 9 7 2 6 5 
000 3 02534 


The total (3 + 2 + 5 + 3 + 4 = 17) gives the number of inversions. 
The extreme values for the number of inversions are zero (for the per- 
mutation 12345678 9) and 36 (for the permutation 987654321). 


TABLE 15.5 Observed and theoretical frequencies of numbers of 

INVERSIONS FOR PERMUTATIONS OF 9 


Number of 
inversions 
0-6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 
* 25 

26 

27 

28 
29 

30-36 

Totals 


Observed 

frequency 

8 

4 

11 

10 

13 

36 

38 

50 

71 

65 

SO 

71 

90 

75 

80 

57 

58 
45 
40 
27 
27 
18 

7 
11 
’ 8 

1000 


Theoretical 

frequency 

6.3 

6.0 

9.9 

15.3 

22.1 

30.4 

39.7 

49.5 

59.1 

67.8 

74.6 
79.0 

50. 6 

79.0 

74.6 
67.8 

59.1 
49.5 

39.7 
30.4 

22.1 

15.3 
9.9 
6.0 

6.3 


1000.0 


Goodness 
of fit, x 2 
0.46 
0.67 
0.12 
1.84 
3.75 
1.03 
0.07 
0.01 
2.40 
0.12 
0.39 
0.81 
1.10 
0.20 
0,39 
1.72 
0.02 
0.41 
0.00 
0.38 
1.09 
0.48 
0.85 
4.17 
0.46 


22.94 
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In intermediate eases the number of inversions serves as a measure of 
the extent to which the order is inverted from the natural order towards 
the reverse order. The frequency distribution of the number of in- 
versions has been studied by Rosander (15.5) and Kendall (15.6), who 
derived from the number a measure of the rank correlation between two 
series of figures. For random permutations of 1 to 1 0 numbers, Kendall 
also tabulated the exact distribution of the number of inversions. His 
distribution was extended to 16 figures for the test given here. 

For permutations of 9 the observed and theoretical distributions are 
•shown in table 15.5. 

The total % 2 , 22.94, has 24 d.f. and agrees closely with expectation. 
The corresponding value for permutations of 16 is x" — 47.52 with 54 
d.f. and a probability value of .72. 
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TABLES OE RANDOM PERMUTATIONS 
TABLE 15.6 Permutations op 9 ( Continued ) 


15.5 


8 6 2 2 4 

2 2 3 3 9 
7 5 6 9 5 

5 4 4 5 3 

6 8 18 2 
19 9 11 

9 18 4 8 

3 3 5 6 6 

4 7 7 7 7 


6 5 3 4 8 

2 3 7 5 9 

7 4 9 8 2 
5 2 2 9 6 
4 7 5 1 7 

8 9 4 3 3 
116 2 1 

3 8 16 4 

9 6 8 7 5 


7 4 2 4 2 

8 5 9 8 7 

9 6 4 5 9 

1 7 7 6 3 

2 2 8 7 5 
5 9 12 1 
6 3 6 1 6 
4 15 9 8 

3 8 3 3 4 


9 7 5 9 7 

2 5 7 7 5 

7 2 3 3 9 

4 4 8 5 2 

5 3 9 8 8 

6 6 14 6 

3 9 4 6 4 

8 8 2 2 1 
116 13 


2 5 0 6 5 

8 115 7 

4 9 2 3 9 
7 8 4 8 1 

5 4 5 7 2 

3 2 3 9 6 
1 6 9 2 3 

6 3 7 1 8 

9 7 8 4 4 


5 4 5 8 3 

8 1 8 3 7 
4 2 6 6 1 
7 5 2 2 4 

9 0 14 5 
3 9 4 1 9 

1 7 9 7 6 

2 8 7 5 2 

6 3 3 9 8 


9 5 3 8 2 

6 17 5 4 

4 7 2 1 9 

7 2 5 9 5 

2 9 8 6 8 

3 6 12 7 

8 4 6 4 1 
1 3 9 3 3 

5 8 4 7 1 


1 2 2 6 2 

8 3 9 2 3 

3 13 7 5 
0 6 7 4 8 
5 9 15 9 

4 8 5 1 6 
7 7 8 9 4 
2 5 4 3 7 

9 4 6 8 1 


3 2 3 5 8 

2 6 6 7 2 

5 17 4 4 

4 9 5 1 9 

1 5 2 9 6 
9 3 12 7 

7 4 9 3 5 

6 7 8 6 3 

8 8 4 8 1 

4 2 8 3 2 

6 5 7 1 8 

3 6 6 6 6 

9 4 4 9 3 

2 3 9 4 5 
18 17 7 

5 7 3 2 9 
8 9 2 8 4 

7 15 5 1 


9 6 5 2 2 

6 5 13 8 

7 4 4 9 6 
5 9 9 8 4 
4 7 8 6 9 
3 3 3 7 5 
2 17 4 3 

8 2 6 1 7 
1 8 2 5 1 


1 6 4 4 6 

8 4 5 9 1 

6 7 2 8 7 

3 2 6 1 8 
5 9 9 2 5 

2 8 8 6 9 

9 113 2 

7 5 3 5 4 

4 3 7 7 3 


8 6 5 2 2 

116 5 9 

7 4 2 8 8 

9 7 16 7 

5 5 9 4 6 

6 9 3 9 4 

3 8 4 1 3 

4 3 8 3 5 
2 2 7 7 1 

117 6 1 

2 7 8 1 3 

5 3 2 7 6 

8 2 4 5 5 

6 4 5 9 4 
4 8 9 4 2 

9 9 6 2 8 

7 6 18 7 

3 5 3 3 9 


4 8 6 2 6 

7 7 2 7 8 
3 4 3 4 7 

5 6 8 3 5 
9 9 4 8 2 

6 5 16 9 
2 3 7 1 1 
119 9 3 

8 2 5 5 4 


4 9 8 2 9 
3 6 9 1 6 

1 3 3 7 5 

5 8 5 9 4 

7 4 2 8 2 

6 5 14 1 

8 2 7 3 7 

2 7 4 6 3 

9 16 5 8 


8 4 9 9 1 

3 7 18 5 

5 5 6 6 2 

6 2 4 3 9 

9 3 2 2 8 

4 8 7 7 4 
2 18 5 7 
19 3 16 

7 6 5 4 3 


6 18 4 7 

7 7 2 9 3 
9 9 4 5 8 

1 5 7 8 5 

2 3 3 6 4 

3 4 9 2 9 
5 6 5 1 1 

8 2 6 3 6 

4 8 17 2 


2 2 6 6 3 

4 8 8 5 9 

7 3 2 3 2 
9 9 9 4 5 

5 5 5 7 7 

6 7 12 8 

3 17 8 4 

8 6 4 9 1 
14 3 16 

4 2 4 1 3 
3 116 8 

5 8 2 3 5 

1 9 3 9 6 

9 3 5 4 9 

7 5 6 2 4 

6 7 7 7 1 

2 4 9 5 7 

8 6 8 8 2 


8 6 6 5 6 

3 7 4 3 5 

9 4 8 8 8 

6 12 7 7 
2 5 5 4 4 

4 8 9 1 1 

7 9 16 9 

5 3 7 9 2 
1 2 3 2 3 


4 3 3 7 9 

7 19 13 

8 8 4 3 7 
6 5 6 2 1 

2 9 2 4 8 
1 2 5 9 4 

3 7 8 6 2 

5 6 15 5 

9 4 7 8 6 


12 18 3 

5 4 4 7 8 

2 9 6 1 7 

6 5 2 4 6 

7 6 5 3 4 
9 19 2 1 

8 8 8 9 2 
4 7 7 6 9 

3 3 3 5 5 


7 3 8 2 7 

8 19 8 9 

3 8 6 6 5 

1 4 3 4 6 

9 6 2 9 1 

4 7 7 7 3 
6 2 5 5 4 

2 9 4 1 8 

5 5 13 2 


9 8 19 7 

3 7 2 6 5 
8 16 7 1 

6 2 8 3 2 

1 5 3 4 4 

7 6 7 8 8 
5 3 5 5 6 

2 4 9 2 9 

4 9 4 1 3 


3 5 6 6 5 
9 3 2 5 2 

5 4 18 7 
115 4 9 

4 8 8 1 3 
2 9 9 3 8 

6 6 4 2 6 
8 7 7 9 1 

7 2 3 7 4 


9 3 4 5 9 

6 9 8 1 5 

2 5 7 3 1 

3 4 5 6 3 

4 2 G 4 2 

7 6 12 7 

5 8 2 9 6 

8 7 3 8 4 
119 7 8 


9 7 7 4 5 

3 4 117 

4 6 3 3 6 

6 2 5 9 8 
2 5 6 7 4 

7 9 2 6 1 

5 3 4 5 2 

8 8 9 2 3 
118 8 9 


4 17 6 7 

7 9 3 8 8 

8 3 8 2 6 

9 5 6 9 3 
6 8 4 4 2 

5 4 9 3 9 
16 17 4 
3 7 5 5 1 
2 2 2 1 5 


4 7 4 9 6 

8 5 6 7 2 
3 2 8 2 4 

5 3 16 9 

6 4 9 4 8 
115 8 3 

9 8 3 3 1 
2 6 2 1 7 

7 9 7 5 5 


423 


113 9 1 

5 8 13 5 
2 2 8 8 2 
4 4 5 7 9 
9 7 G 6 8 
8 6 2 5 7 

6 5 7 2 4 
3 3 4 1 3 

7 9 9 4 6 


3 2 5 6 1 

6 4 4 9 2 
5 5 3 2 5 
9 3 2 4 4 

7 9 7 5 6 
2 8 6 7 3 

4 7 13 7 
119 18 

8 0 8 8 9 


4 6 14 7 

6 5 8 1 6 
9 8 3 7 8 

1 4 5 9 2 

7 19 5 4 
3 3 2 2 9 

5 7 7 8 5 

2 2 4 6 1 

8 9 6 3 3 


8 9 4 3 7 

9 2 19 2 

5 6 2 8 1 
7 8 8 4 9 

3 7 7 6 8 
116 15 

4 4 9 2 6 
2 3 3 7 4 

6 5 5 5 3 


13 4 14 

4 19 5 9 
3 2 8 7 3 

5 4 7 6 5 
2 5 3 4 1 

6 7 13 6 

7 6 6 9 8 
9 8 5 2 7 

8 9 2 8 2 



424 RANDOM PERMUTATIONS OP 9 AND 16 NUMBERS 16.5 








TABLE 

15.6 

Permutations 

> OF 9 1 

{Continued) 







7 3 6 

5 3 

3 

2 

S 

9 

6 

3 

1 

4 6 

8 

8 

4 6 

8 

6 

6 

3 

7 

1 

2 

5 

8 

5 3 

5 

1 

8 

4 

7 

5 

5 6 4 

2 

6 

9 

1 

5 

3 

7 

2 

2 

2 7 

4 

6 

6 3 

6 

4 

7 

8 

3 

6 

8 

9 

4 

9 7 

9 

7 

2 

9 

5 

7 

4 

8 8 

1 

5 

4 

9 

7 

5 

9 

1 

6 

9 1 

2 

2 

9 1 

9 

9 

4 

6 

2 

9 

3 

8 

7 

4 2 

S 

9 

4 

5 

2 

3 

3 

7 3 

4 

8 

5 

5 

3 

2 

1 

S 

8 

5 5 

9 

5 

1 5 

2 

2 

3 

7 5 

7 

4 

6 

1 

1 1 

4 

5 

7 

8 

4 

8 

8 

9 5 

0 

7 

1 

4 

9 

1 

2 

6 

4 

6 8 

7 

3 3 9 

7 

1 

5 

9 

1 

5 

5 

1 

3 

6 4 

6 

3 

3 

6 

3 

9 

2 

6 9 

8 

1 

2 

8 

3 

6 

3 

4 

7 8 2 

1 

4 

8 4 

1 

3 

.1 

4 

4 

3 

7 

7 

9 

2 8 

7 

4 

9 

2 

6 

1 

6 

4 2 

3 

2 

7 7 

6 

S 

5 

5 

3 

3 9 

3 

7 2 2 

3 

5 

8 

2 

9 

4 

6 

3 

2 

3 6 

2 

6 

5 

1 

9 

4 

0 

1 1 

7 9 

6 

3 

4 

4 

8 

9 

5 

7 4 

5 

1 

5 8 

5 

8 

2 

1 

8 2 9 

2 

6 

7 5 

1 

2 

6 

3 

8 

2 

1 

2 7 

6 

4 

8 6 

2 

7 

4 

7 

9 

1 3 

6 

9 

7 7 

4 

7 

9 

5 

6 

8 

1 

4 

5 

8 9 

3 

8 

1 

7 

1 

6 

4 

7 7 

1 

2 

‘4 

4 

4 

2 

9 

8 

7 4 5 

5 

7 

1 1 

9 

1 

2 

2 

4 

1 

7 

8 

8 

2 7 

1 

4 

7 

6 

9 

8 

9 

4 2 

2 

4 

2 

1 

6 

6 

4 

5 

2 

3 3 

7 

4 

3 5 

8 

9 

9 

7 

7 

9 

3 

2 

6 

8 4 

8 

S 

9 

l 

8 

1 

3 

3 8 

3 

6 

5 

3 

9 

8 

8 

4 

3 

1 7 

8 

8 

4 2 

6 

8 

7 

5 

S 

3 

8 

4 

5 

7 9 

2 

7 

4 

7 

5 

2 

1 

8 3 

4 

7 

8 

7 

2 

3 

1 

9 

5 8 2 

3 

2 

5 3 

7 4 

3 

9 

9 

5 

6 

5 

3 

1 3 

4 

5 

3 

5 

2 

3 

5 

3 9 

9 

8 

9 

6 

5 

9 

6 

2 

8 

2 4 

4 

1 

7 8 

4 

2 

5 

8 

6 

8 

4 

6 

2 

5 6 

6 

2 

2 

9 

1 

4 

6 

2 5 

7 

1 

6 

8 

8 

4 

7 

3 

1 

6 9 

6 

9 

6 9 

2 

7 

8 

0 

5 

4 

9 

9 

4 

0 5 

5 

9 

5 

2 

6 

9 

2 

6 1 

8 

9 

3 

5 

3 

5 

3 

7 

4 

7 1 

2 

3 8 6 

3 

6 

4 

4 

1 

6 

1 

7 

9 

3 1 

9 

6 

8 

4 

7 

5 

7 9 6 

5 

5 

7 

2 

1 

1 

2 

6 

9 

5 6 

1 

5 

9 4 

1 

3 

6 

i 

3 

7 

5 

3 

7 

6 8 

7 

1 

6 

3 

3 

6 

8 5 4 

6 

3 

1 

9 

7 

7 

5 

1 

6 

9 8 

9 

6 

2 7 

5 

5 

1 

3 

2 

2 

2 

1 

1 

4 2 

3 

3 

1 

8 

4 

7 

7 4 7 8 

5 

3 

5 

9 

5 

7 

6 

7 4 1 

3 

2 

2 2 

8 

4 

4 

2 

2 

2 

5 

4 

4 

4 1 

7 

1 

1 

5 

l 

7 

1 

7 5 

4 

6 

5 

4 

1 

7 

6 

8 

8 3 8 

5 

5 

1 9 

5 

6 

3 

7 

7 

5 

1 

5 

8 

6 9 

4 

8 

5 

1 

3 

3 

2 

6 1 

1 

9 

8 

8 

8 

3 

3 

2 

6 

5 9 

6 

9 

5 8 

4 

5 

6 

4 

6 

8 

3 

1 

9 

7 2 

1 

4 

3 

2 

6 

4 

6 

3 6 

2 

7 

7 7 4 

1 

5 

9 

3 

1 3 

1 

3 

3 1 

7 

1 

9 

5 

1 

3 

7 

9 

5 

9 5 

3 

6 

9 

S 

5 

6 

3 

5 4 

9 

8 

2 

6 

6 

6 

9 

4 

2 

9 5 

7 

6 

7 7 

3 

7 

5 

6 

3 

9 

2 

6 

3 

2 7 

2 

5 

8 

7 

9 

1 

4 

8 9 

0 

2 

9 

9 

7 

8 

2 

3 

9 

7 7 

9 

1 

6 5 

9 

9 

2 

1 

5 

0 
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1 6 

8 

3 
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6 

2 

5 
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4 
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4 

1 

7 
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4 
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8 
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9 
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3 

4 

7 

4 5 

3 
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2 
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4 

5 

8 8 
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5 
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7 
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5 

3 

3 8 

8 

2 

6 8 
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4 
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9 

6 

5 

1 
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2 

4 

7 3 

3 

3 

8 3 2 5 8 

6 6 2 4 4 

1 

1 
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8 9 6 9 

4 

4 

5 

5 

8 

2 

9 

4 

2 8 

1 

3 

4 

7 5 
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TABLES OF RANDOM PERMUTATIONS 
TABLE 15.6 Permutations op 9 ( Continued ) 


9 8 1 

1 9 

4 2 2 

9 3 

7 1 9 

2 6 

l 7 4 

5 5 

6 6 8 

3 4 

2 9 6 

6 2 

3 5 3 

4 1 

8 4 7 8 7 

5 3 5 

7 S 

2 4 8 

1 4 

5 2 4 

9 8 

7 1 6 

7 5 

9 8 5 

S 1 

3 7 1 

3 7 

6 3 2 

2 0 

8 6 3 

4 9 

1 9 7 

5 2 

4 5 9 

6 3 

3 5 9 

5 1 

6 2 7 3 7 

9 3 2 

8 9 

5 7 6 

9 3 

7 4 8 

6 2 

1 9 1 

7 4 

8 6 5 

1 8 

4 8 4 

2 6 

2 1 3 

4 5 

4 6 6 

2 2 

3 2 2 

3 4 

2 5 5 

5 3 

1 4 8 

7 1 

9 1 4 

8 5 

5 7 9 

9 7 

7 8 3 

6 6 

6 3 1 

4 8 

8 9 7 

1 9 

8 5 3 

9 2 

7 2 5 

7 5 

5 14 

5 7 

9 6 7 

2 4 

4 8 6 

8 3 

6 4 9 

6 8 

2 7 8 

4 6 

19 2 

1 1 

3 3 1 

3 9 


4 7 6 3 4 

6 2 7 8 1 

1 9 5 6 3 

5 8 8 5 7 

2 5 2 4 5 

8 3 19 6 

7 4 9 1 8 

9 6 4 7 9 

3 13 2 2 

9 9 9 5 2 

8 7 3 8 3 
7 8 5 3 6 

3 3 16 4 

4 5 7 7 5 

1 6 6 9 1 

6 4 8 4 9 

2 2 2 1 7 

5 14 2 8 

4 18 8 6 
2 8 6 2 2 

5 3 4 9 1 

7 5 7 4 3 

8 7 5 1 5 

3 9 2 5 9 

6 6 16 8 

1 4 9 3 7 

9 2 3 7 4 

2 2 5 6 5 

7 7 9 7 9 

6 6 12 7 

3 3 8 1 3 

9 8 3 8 4 
1 5 6 5 6 

8 4 2 9 2 

4 9 4 3 8 

5 17 4 1 

1 7 9 9 6 

9 9 4 7 7 

7 2 3 4 1 

4 6 8 1 5 

2 8 6 2 4 

5 118 3 

6 4 5 6 9 

3 5 2 3 2 

8 3 7 5 8 


6 2 12 8 

3 9 6 3 7 

5 8 8 7 3 
117 6 4 
2 7 2 8 5 
9 3 5 1 6 

4 4 9 4 9 

7 6 4 9 1 

8 5 3 5 2 

5 6 3 7 8 

2 2 13 1 

7 3 4 9 5 

4 9 7 6 9 

9 7 9 1 3 

3 8 5 8 6 

6 16 2 2 

8 5 8 4 7 

1 4 2 5 4 

6 5 4 5 5 

9 7 7 7 4 

2 13 9 7 

4 9 6 6 1 

5 4 2 4 6 

3 3 113 

1 2 8 3 8 
8 8 5 2 2 

7 6 9 8 9 

5 7 5 6 4 

6 12 9 1 

8 9 6 5 6 

4 8 8 7 9 

2 3 4 4 8 

3 6 18 5 

7 4 7 2 2 

1 5 9 3 7 

9 2 3 1 3 

5 8 8 8 5 
9 1117 

7 2 3 9 4 

2 3 9 3 1 

4 6 5 7 8 

6 4 7 6 3 

8 5 2 5 6 
1 9 4 4 9 

3 7 6 2 2 


7 4 8 2 4 

5 6 9 4 5 

4 16 11 

1 9 4 5 2 

2 5 2 9 9 

3 2 7 7 7 
9 3 18 8 

6 8 5 6 3 

8 7 8 3 6 

3 2 3 8 1 

6 9 9 1 9 

2 7 7 7 8 

8 6 2 5 7 

5 14 3 5 

9 3 12 2 

1 5 6 4 4 

4 8 8 9 3 

7 4 5 6 6 

3 9 8 0 3 
9 4 3 9 9 
16 2 12 

7 7 7 4 7 

4 3 5 8 5 

8 2 15 1 

2 5 9 7 4 

6 8 0 2 8 

5 14 3 6 

6 2 7 1 6 
19 14 3 

4 7 9 3 1 

3 3 2 2 9 

7 5 5 8 2 

5 6 8 6 4 

2 8 6 5 7 

8 14 9 8 

9 4 3 7 5 

3 8 2 4 7 

9 3 8 5 6 

4 7 9 1 9 

7 5 7 8 5 

5 2 16 8 

1 9 3 3 2 

8 14 7 1 

2 6 6 2 4 

6 4 5 9 3 


2 0 3 1 6 

9 3 0 6 1 

12 19 4 

5 7 9 7 5 

7 17 8 2 

6 4 8 4 3 

8 5 4 2 8 

3 8 2 5 9 

4 9 5 3 7 

2 114 8 

1 5 4 6 f> 

7 7 6 2 2 

8 8 2 5 5 

9 3 9 1 3 

5 4 8 7 4 

6 2 3 3 1 

4 6 5 9 7 

3 9 7 8 9 

8 6 8 9 1 

6 5 6 1 5 

9 8 5 3 2 

5 4 3 5 8 

1 9 7 7 9 

4 7 9 2 4 

3 114 7 

7 3 4 8 3 

2 2 2 6 6 

4 8 3 4 0 

1 9 2 2 2 

8 4 5 7 7 

3 7 7 6 9 

9 5 118 

2 18 8 4 

7 6 6 9 3 

5 2 4 5 5 

6 3 9 3 1 

8 4 13 8 

7 7 3 4 7 

6 2 5 1 9 

9 5 7 9 4 

119 8 3 

3 3 6 2 2 

4 9 4 6 1 

5 8 2 5 6 

2 6 8 7 5 


6 9 9 6 7 

3 5 15 3 

2 4 2 2 8 

4 7 8 1 5 

8 8 6 7 9 

9 2 5 8 1 

7 17 4 2 

1 6 3 9 6 

5 3 4 3 4 

9 7 2 9 7 

3 6 8 6 0 
5 5 15 9 

7 2 9 2 8 
14 3 12 

2 9 5 8 1 

0 3 6 4 6 

4 8 7 7 4 

8 14 3 3 

2 6 5 3 1 

7 13 8 5 

1 2 4 6 3 

4 4 12 4 

8 7 8 5 9 

5 9 6 4 2 

9 3 9 9 8 

3 8 2 1 7 

6 5 7 7 6 

2 2 5 7 5 
9 8 3 1 3 

4 6 7 8 1 

7 9 6 9 8 

6 7 8 6 2 

5 3 9 4 6 

8 4 13 7 

3 12 2 9 
1 5 4 5 4 

7 116 5 

8 2 8 7 2 

3 4 7 3 1 

1 5 9 2 3 

9 9 4 8 8 

2 7 2 9 9 

5 8 6 1 7 

6 6 3 5 6 

4 3 5 4 4 


9 9 2 4 2 

2 6 8 3 7 
1 7 7 9 8 

5 2 5 2 3 

3 4 114 

7 3 3 7 5 

6 8 4 8 6 

8 16 5 9 

4 5 9 6 1 

7 2 8 4 8 

9 8 7 9 4 

5 12 6 3 

8 5 4 2 6 

1 6 9 7 5 

4 4 1 12 

3 9 6 8 1 

6 3 5 5 7 

2 7 3 3 9 

1 2 7 7 2 

3 6 3 9 6 

8 3 5 2 7 

5 9 2 1 3 

6 4 9 5 9 

7 5 8 3 4 

4 114 1 

9 7 4 6 8 

2 8 6 8 5 

7 6 3 5 6 

5 4 12 7 

4 3 7 1 8 

3 8 5 9 9 

8 2 8 6 5 

6 14 8 4 

9 9 6 3 2 

1 7 2 7 3 

2 5 9 4 1 

4 4 7 2 2 

2 9 14 7 

8 2 8 9 8 

5 7 2 1 3 

6 15 8 6 

7 3 3 5 5 

9 5 6 3 4 
1 8 4 6 1 

3 6 9 7 9 
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4 2 6 5 9 

6 8 13 1 
3 6 4 7 8 

1 4 8 2 4 

7 9 9 6 6 
9 5 3 1 3 

2 15 8 5 

5 3 7 9 7 

8 7-2 4 2 

1 6 3 8 3 

2 2 8 9 8 

8 7 7 5 1 

3 5 6 1 4 

9 3 2 7 5 

4 4 9 4 6 

7 8 16 7 

6 14 3 9 

5 9 5 2 2 

4 9 6 6 1 

5 3 19 6 

8 6 8 5 7 
3 7 4 4 2 

9 8 3 3 9 

2 4 7 1 4 

1 5 9 2 8 

7 15 8 3 

6 2 2 7 5 

3 17 2 7 

2 2 5 8 8 

1 5 8 6 9 

6 3 2 7 6 
9 8 4 1 4 

5 7 6 3 3 

4 6 15 5 

8 4 9 9 1 

7 9 3 4 2 

9 2 4 9 5 
118 13 

3 9 5 5 7 

8 6 6 7 2 
7 5 3 3 9 

2 4 9 2 1 

4 3 7 8 4 

6 8 16 6 

5 7 2 4 8 


1 4 9 7 8 
9 6 4 4 2 

7 5 18 4 

3 7 5 5 6 

6 17 13 

4 3 3 2 7 

8 2 6 3 5 

2 8 8 9 9 

5 9 2 6 1 

7 2 6 7 9 

2 5 5 2 6 

9 7 9 5 3 

3 6 8 9 1 

4 17 3 7 
18 2 12 

5 4 3 6 8 

8 9 4 4 5 

6 3 18 4 

118 3 1 

6 8 2 5 9 

7 5 113 

2 2 9 4 5 

4 9 6 7 2 

3 3 3 8 6 

9 6 5 6 8 

5 4 7 9 4 

8 7 4 2 7 

5 4 3 6 3 

9 6 5 5 5 

2 2 12 4 
8 5 8 8 1 

7 3 7 3 8 

4 12 7 9 

1 7 4 4 6 

3 9 0 9 2 

6 S 9 1 7 

4 8 4 4 8 

8 6 5 9 9 

2 4 9 3 3 

6 9 15 6 

7 3 6 8 7 

9 7 2 2 4 

5 18 12 

3 5 7 7 1 
1 2 3 6 5 


7 4 6 4 3 

2 18 3 9 
9 2 2 7 S 

3 5 9 8 2 

5 7 5 6 1 
16 4 15 

4 3 15 4 

6 9 7 2 7 

8 8 3 9 6 

7 8 16 5 

3 4 8 5 1 

4 3 7 7 9 
15 4 14 

9 19 3 7 
2 2 6 8 2 

6 9 3 2 6 

5 6 2 9 3 

8 7 5 4 8 

3 7 5 4 9 

6 5 4 9 2 

7 3 9 2 7 

9 13 3 8 

8 6 7 8 3 

2 9 114 

1 4 6 6 1 

5 8 8 7 5 

4 2 2 5 6 

9 8 6 4 4 

3 14 8 8 

4 9 9 9 1 

7 5 7 2 2 

6 4 5 3 9 

5 2 2 7 7 
13 115 

8 6 8 5 3 

2 7 3 6 6 

1 9 4 8 5 

2 7 6 7 7 

8 19 2 3 

9 6 5 3 1 

6 8 2 5 4 

4 2 7 4 8 

7 3 3 9 9 

5 5 116 

3 4 8 6 2 


2 12 2 4 

5 9 6 5 9 

7 4 4 7 8 

6 3 7 3 7 

8 5 3 9 3 

4 2 8 8 1 

1 6 5 4 5 

3 8 112 

9 7 9 6 6 

114 4 8 

4 8 7 2 1 

5 5 2 9 6 

8 7 17 5 

2 4 9 3 4 

7 9 3 6 3 

3 6 5 8 7 

9 2 6 1 2 

6 3 8 5 9 

9 7 4 9 9 

2 8 5 6 3 

6 9 7 3 6 
12 117 

8 19 2 8 

3 6 3 7 1 

5 5 2 5 2 

7 4 8 8 5 

4 3 6 4 4 

8 6 6 9 6 

3 9 3 1 7 

1 3 4 6 8 

4 5 2 5 1 

5 7 7 2 9 

9 4 14 4 

6 8 9 8 3 

2 15 7 5 

7 2 8 3 2 

2 7 9 6 5 

6 8 6 9 8 

7 6 5 7 7 
117 5 1 

3 4 14 6 

8 3 4 3 2 

9 9 2 1 9 
5 2 3 8 3 

4 5 8 2 4 


3 3 6 8 1 
2 2 7 1 8 
1 9 9 2 4 

4 5 5 3 9 

5 4 4 9 5 
9 7 2 4 2 
8 6 3 6 3 

0 115 7 

7 8 8 7 6 

1 2 7 8 1 

6 5 12 2 

2 7 9 5 6 

8 8 8 6 7 

7 9 4 7 8 

4 6 6 1 5 

3 4 3 4 9 

9 15 9 4 

5 3 2 3 3 

9 4 8 8 3 

2 19 4 2 

8 6 5 1 1 

3 9 6 2 9 
1 7 3 5 6 

4 2 13 4 

7 5 4 6 7 

5 3 7 9 5 

6 8 2 7 8 

5 8 12 6 

7 3 7 5 7 

8 4 6 7 4 

1 2 5 6 5 

3 6 2 9 9 

2 13 3 1 

6 7 4 4 8 

4 5 9 1 2 

9 9 8 8 3 

9 8 7 3 4 

2 2 2 2 9 

6 7 8 6 7 

8 3 4 5 8 

5 9 5 9 2 
4 6 9 7 1 

3 1113 
1 5 6 8 6 

7 4 3 4 5 


4 7 16 4 

7 9 8 9 5 

6 3 7 4 9 

5 6 2 5 2 
3 8 9 7 8 

8 4 5 8 6 

9 13 3 7 
12 6 11 

2 5 4 2 3 

8 9 7 6 9 

9 5 14 2 

5 3 8 9 4 

7 2 9 7 8 

3 6 2 5 6 

2 4 4 2 5 

6 8 6 1 7 

4 15 8 1 

1 7 3 3 3 

3 2 5 1 3 

8 6 4 2 6 

9 5 9 9 8 

4 8 2 5 4 

5 3 3 3 1 

6 9 8 7 5 
1118 9 

2 7 7 6 7 

7 4 6 4 2 

5 4 111 

6 7 4 4 9 

2 8 3 9 2 

7 2 9 7 6 
4 6 5 2 7 
1 9 2 6 3 

3 3 8 5 5 

8 5 7 3 8 

9 16 8 4 

3 8 2 1 3 

1 4 8 6 2 

2 5 9 5 7 
9 3 4 2 8 

6 2 5 7 5 

7 7 6 9 6 

8 9 7 3 4 

4 6 3 8 9 

5 114 1 


9 9 3 2 3 

2 4 2 5 4 

6 15 6 6 

4 6 13 2 

7 3 7 7 1 

5 8 4 8 8 

8 2 6 9 5 

3 7 8 4 7 
15 9 19 

7 5 8 1 7 

3 9 4 3 8 

6 2 6 8 3 

8 4 7 7 4 

1 7 9 9 1 

2 6 2 5 6 

9 13 6 5 
5 8 12 9 

4 3 5 4 2 

9 5 6 8 8 
8 17 9 6 

7 3 8 1 1 

1 2 3 7 7 

2 9 9 6 9 

5 4 2 5 5 
4 7 4 3 2 

3 8 5 4 4 

6 0 12 3 

12 17 3 

3 7 3 3 4 

8 9 5 9 2 

4 4 2 4 7 

7 6 4 8 1 

2 3 8 5 6 

9 8 6 6 8 

5 17 19 

6 5 9 2 5 

3 5 3 2 6 
2 8 9 8 4 
14 118 

5 16 7 2 

6 9 7 3 7 

7 2 2 6 1 

4 3 5 4 9 

8 6 4 9 5 

9 7 8 5 3 
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15.5 TABLES OF RANDOM PERMUTATIONS 


TABLE 15.6 Permutations op 9 ( Continued ) 


2 

3 

1 

7 

1 

4 

2 4 

3 1 

9 2 

1 

1 

] 

4 

6 

9 3 5 

3 6 

2 

5 5 

1 

1 

4 1 

2 

7 

l 4 

5 

3 

5 

9 

6 

3 

3 

3 

4 

6 

8 6 

4 1 

8 

6 

6 

8 

1 

2 7 9 

1 5 

1 

2 8 

6 

6 

3 4 

5 

5 

6 1 

4 

0 

8 

4 

5 

5 

8 

6 

1 

9 

1 5 

5 6 

9 

4 

7 

6 

3 

5 4 7 

5 1 

7 

7 2 

3 

8 

3 5 

4 

3 

2 2 

8 

7 

1 

6 

8 

1 

7 

2 

9 

2 

5 3 

8 9 4 7 

9 

7 

4 

7 2 2 

9 4 

9 

9 3 

9 

5 

9 2 

1 

9 

8 6 

7 

4 

4 

1 

2 

4 

9 

5 

5 

3 

6 9 

7 8 

6 

3 

8 

1 

8 

4 9 6 

6 2 

8 

1 7 

4 

7 

8 3 

7 

8 

3 3 

9 

cS 

6 

2 

7 

8 

6 

8 3 

7 4 7 

2 5 

7 

5 

3 

3 

2 

6 8 8 

2 8 

5 

3 9 

2 

2 

2 0 

6 

1 

7 8 

2 

2 

3 

5 

4 

9 

2 

1 

7 

8 7 4 

3 4 

2 

8 

5 

5 

9 

8 6 4 

8 3 

3 

8 4 

5 

4 

6 8 

3 

4 

9 5 

C> 

9 

9 

8 

3 

6 

5 

7 

6 

5 

2 2 

1 7 

5 

9 

2 

2 

7 

1 5 3 

7 9 

4 

6 6 

8 3 

7 7 8 

6 

4 9 

3 

3 

7 

7 

9 

2 

4 

9 

8 

3 

9 8 

6 3 

3 

2 

4 

9 

5 

3 3 1 

4 7 

6 

4 3 

7 

9 

5 6 

9 

2 

6 7 

1 

0 

9 

9 

6 

S 

1 

9 

5 

1 

6 2 

8 9 

5 

8 

9 

7 

7 

4 9 9 

4 9 

5 

4 6 

5 

9 

5 5 

2 

6 

8 9 

7 

3 

1 

7 

9 

5 

6 

1 

8 

7 

5 1 

1 1 

7 

7 

8 

2 

4 

7 4 7 

9 5 

1 

2 2 

1 

6 

1 9 

8 

8 

(5 6 

1 

S 

7 3 

2 

7 

9 

4 

1 

2 

4 3 

3 0 

1 

3 

3 

3 

3 

8 6 1 

5 6 

3 

5 5 

4 

4 

4 2 

9 

5 

3 1 

3 

7 

8 

5 

7 

4 

4 

5 

9 

4 

1 4 

7 2 

3 

4 

6 

5 

9 

2 2 6 

3 8 

8 

1 3 

7 

8 

7 1 

4 

4 

1 8 

9 

9 

4 

8 8 

6 

8 

3 

2 

3 

9 7 

6 7 

2 

5 

1 

6 

6 

3 8 5 

2 1 

4 

3 3 

3 

7 

0 7 7 

3 

9 7 

6 

2 

6 

1 

4 

9 

7 

8 3 

9 

8 6 

5 8 

6 

1 

2 

8 

2 

5 1 2 

1 4 

2 

9 8 

9 

2 

2 8 

3 

2 

7 2 

A 

r> 

2 

6 

1 

3 

2 

6 

7 

5 

3 5 

2 5 

4 

6 

7 

1 

1 

6 5 3 

6 7 

7 8 9 

8 

3 

3 6 

5 

3 

5 3 

8 

3 

3 

2 

5 

1 

3 

7 

6 

6 

2 8 

9 3 

8 

2 

5 

9 

8 

1 3 8 

8 3 

0 7 4 

6 

1 

9 3 

6 

7 

4 4 

2 

6 

5 

4 

3 

2 

5 

2 

4 

8 

7 9 

4 4 

9 

9 

4 

4 

5 

9 7 4 

7 2 9 

6 7 
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2 

11 

6 

1 

1 

15 

3 

1 

7 

12 

1 

9 

7 

6 

15 

7 

14 

5 

15 

13 

4 

15 

12 

5 

3 

S 

2 

9 

8 

14 

12 

16 

4 

4 

<5 

5 

5 

11 

16 

8 

0 

10 

15 

3 

2 

9 

1 

13 

n 

7 

11 

6 

10 

14 

10 

10 

4 

8 

9 

9 

3 

8 

2 

7 

0 

11 

5 

15 

15 

4 

12 

3 

9 

14 

5 

C 

11 

10 

5 

3 

15 

4 

6 

11 

S 

1 

12 

4 

2 

10 

7 

14 

13 

8 

9 

9 

2 

8 

12 

13 

5 

15 

10 

4 

15 

2 

7 

1 

1 

12 

10 

10 

12 

13 

7 

4 

13 

7 

3 

14 

3 

9 

5 

9 

3 

13 

10 

3 

10 

IB 

10 

1 

14 

16 

2 

8 

6 

11 

6 

11 

14 

15 

2 

8 

8 

1 

10 

9 

11 

13 

13 

3 

5 

13 

10 

11 

8 

4 

2 

4 

3 

9 

14 

12 

5 

8 

2 

7 

6 

11 

6 

7 

11 

15 

0 

9 

10 

12 

16 

10 

7 

4 

10 

5 

35 

15 

12 

8 

10 

2 

2 

11 

13 

3 

1 

10 

14 

6 

14 

12 

4 

13 

35 

9 

O 

5 

5 

1 

1 

3 

12 

1 

8 

4 

14 

14 

6 

16 

7 

7 

10 

3 

2 

6 

0 

5 

16 

7 

8 

1 

14 

6 

9 

9 

4 

15 

8 

0 

13 

10 

10 

2 

10 

2 

7 

3 

1 

3 

12 

15 

12 

10 

15 

10 

12 

2 

12 

8 

1 

3 

9 

7 

5 

5 

11 

7 

15 

13 

11 

2 

11 

5 

10 

4 

10 

16 

9 

11 

7 

8 

4 

13 

14 

16 

14 

1 

11 

4 

14 

5 

13 

4 

1 

15 

13 

6 

14 

12 

3 

0 

8 



432 RANDOM PERMUTATIONS OF 9 AND 16 NUMBERS 15.5 
TABLE 15.7 Permutations of 16 (Continued) 


l 

10 

14 

10 

12 

10 

16 

15 

3 

4 

11 

8 

7 

14 

14 

14 

3 

13 

9 

11 

4 

12 

16 

15 

5 

2 

2 

Q 

5 

9 

7 

16 

5 

7 

10 

12 

C 

10 

6 

6 

15 

4 

8 

13 

13 

10 

14 

2 

11 

2 

3 

1 

3 

1 

8 

9 

13 

4 

12 

3 

13 

7 

6 

4 

15 

5 

11 

1 

2 

7 

6 

9 

11 

10 

1 

8 

5 

12 

8 

10 

5 

1 

3 

C> 

16 

7 

11 

10 

4 

11 

15 

8 

16 

10 

12 

9 

13 

1 

13 

7 

16 

5 

6 

15 

5 

4 

2 

11 

36 

0 

3 

6 

14 

3 

8 

0 

9 

2 

1 

15 

14 

7 

8 

2 

9 

12 

10 

5 

11 

1 

2 

14 

13 

12 

2 

8 

4 

7 

7 

4 

13 

15 

12 

5 

3 

11 

12 

4 

9 

10 

1 

10 

15 

14 

13 

10 

3 

9 

8 

14 

8 

8 

15 

3 

2 

10 

1 

13 

36 

13 

11 

6 

5 

1 

8 

1 

13 

10 

14 

3 

7 

11 

2 

G 

5 

13 

5 

f! 

4 

12 

5 

12 

1 

10 

10 

9 

14 

3 

13 

1 

15 

4 

14 

9 

9 

16 

9 

16 

12 

7 

4 

2 

8 

8 

0 

(5 

15 

U 

7 

4 

14 

10 

12 

11 

11 

3 

10 

9 

5 

15 

12 

3 

4 

15 

10 

2 

7 

7 

2 

14 

IS 

1 

1 

9 

7 

10 

14 

3 

13 

4 

4 

15 

14 

11 

12 

15 

12 

10 

4 

14 

2 

9 

1G 

16 

16 

7 

10 

8 

S 

15 

9 

8 

15 

3 

13 

6 

5 

7 

10 

1 

16 

6 

11 

7 

9 

5 

4 

12 

15 

5 

11 

2 

4 

6 

3 

14 

7 

13 

1 

11 

12 

3 

2 

12 

6 

3 

16 

9 

2 

10 

1 

11 

6 

5 

2 

8 

13 

5 

14 

8 


14 

3 

10 

10 

15 

11 

10 

0 

3 

6 

16 

2 

3 

6 

3 

8 

13 

16 

8 

9 

2 

12 

7 

10 

5 

15 

11 

O 

4 

11 

13 

6 

8 

15 

7 

1 

1 

15 

1 

S 

10 

5 

9 

9 

10 

9 

14 

13 

13 

4 

4 

7 

11 

5 

1 

5 

S 

1 

14 

14 

12 

9 

5 

11 

10 

G 

4 

4 

12 

2 

7 

10 

14 

7 

13 

3 

15 

12 

2 

12 

14 

16 

1 

12 

5 

13 

8 

8 

6 

14 

6 

15 

0 

3 

10 

9 

9 

10 

2 

3 

4 

10 

11 

15 

13 

3 

14 

15 

9 

1 

12 

12 

5 

1 

6 

15 

7 

7 

13 

8 

8 

11 

2 

14 

16 

7 

3 

3 

11 

15 

2 

4 

13 

5 

9 

1 

13 

4 

4 

4 

11 

5 

9 

16 

7 

5 

1 

12 

7 

2 

10 

6 

1G 

10 

12 

16 

2 

14 

8 

11 

15 

10 

15 

7 

1 

16 

11 

G 

4 

11 

8 

6 

10 

10 

2 

11 

13 

1 

6 

3 

3 

12 

2 

14 

9 

4 

9 

7 

16 

4 

12 

10 

3 

1 

10 

1 

15 

13 

n 

14 

5 

5 

5 

12 

0 

14 

1 

14 

15 

13 

7 

7 

10 

5 

12 

10 

4 

4 

3 

10 

6 

2 

8 

9 

7 

9 

8 

9 

2 

8 

13 

3 

11 

8 

15 

2 

14 

12 

13 

5 

6 

1 

2 

13 

2 

5 

13 

9 

10 

8 

10 

11 

8 

15 

12 

11 

4 

12 

7 

3 

2 

6 

4 

9 

10 

14 

15 

10 

12 

13 

7 

16 

U 

G 

4 

3 

3 

16 

14 

7 

8 

14 

6 

2 

11 

1 

2 

1 

8 

5 

9 

10 

14 

11 

9 

4 

7 

3 

16 

1 

16 

5 

13 

3 

15 

12 

9 

15 

1 

6 

13 

12 

7 

4 

14 

15 

8 

5 

5 

16 


15 

5 

2 

6 

9 

1 

15 

10 

2 

4 

4 

14 

7 

14 

3 

6 

13 

6 

15 

14 

14 

12 

5 

11 

7 

9 

S 

12 

1 

16 

12 

7 

16 

13 

8 

2 

9 

3 

12 

15 

11 

11 

11 

3 

12 

10 

1 

4 

7 

13 

S 

2 

8 

5 

1 

13 

6 

9 

10 

10 

3 

16 

1 

9 

11 

7 

10 

13 

8 

5 

10 

4 

15 

4 

6 

5 

3 

14 

16 

2 

8 

8 

4 

5 

13 

3 

4 

16 

14 

14 

13 

14 

12 

7 

6 

16 

2 

1 

10 

8 

14 

10 

10 

11 

5 

11 

5 

8 

16 

7 

6 

13 

5 

9 

4 

2 

1 

9 

13 

15 

7 

7 

14 

8 

3 

1 

9 

7 

2 

10 

10 

12 

15 

3 

1 

12 

11 

6 

G 

9 

4 

15 

2 

1 

11 

9 

6 

3 

4 

16 

15 

10 

13 

12 

2 

5 

3 

11 

15 

12 

8 

11 

1 

5 

9 

4 

1 

14 

6 

15 

5 

3 

2 

13 

8 

G 

2 

12 

2 

7 

16 

7 

10 

3 

10 

9 

14 

3 

1 

4 

11 

10 

10 

15 

13 

14 

12 

13 

9 

12 

15 

8 

4 

4 

14 

2 

15 

5 

10 

3 

1 

4 

8 

11 

16 

10 

13 

11 

16 

1 

7 

5 

7 

7 

2 

12 

6 

9 

8 

5 

13 

16 

15 

14 

11 

3 

9 

6 

12 

G 

4 

9 

1 

4 

13 

6 

10 

14 

1G 

11 

9 

11 

9 

G 

14 

7 

8 

2 

3 

7 

3 

2 

15 

10 

2 

5 

5 

6 

8 

8 

10 

a 

4 

12 

3 

16 

12 

16 

7 

5 

13 

14 

5 

13 

12 

15 

1 

8 

5 

10 

2 

7 

12 

14 

6 

11 

15 

11 

11 

1 

8 

16 

' 3 

1 

9 

12 

4 

7 

9 

4 

14 

13 

10 

2 

16 

1 

3 

13 

15 

15 


8 

4 

12 

G 

2 

3 

12 

11 

12 

7 

10 

16 

7 

4 

9 

5 

15 

14 

3 

14 

2 

7 

5 

2 

4 

16 

6 

1 

9 

15 

12 

9 

15 

7 

10 

11 

14 

9 

30 

3 

1 

11 

13 

5 

6 

4 

3 

4 

1 

8 

13 

10 

6 

13 

1 

7 

5 

10 

S 

13 

14 

8 

2 

14 

12 

6 

1 

S 

10 

10 

15 

13 

16 

11 

5 

9 

2 

3 

15 

11 

3 

8 

6 

10 

1 

7 

G 

10 

9 

16 

14 

15 

3 

14 

6 

10 

3 

7 

2 

14 

5 

16 

13 

6 

15 

G 

1 

9 

4 

5 

11 

9 

4 

5 

4 

4 

10 

16 

10 

10 

13 

12 

15 

11 

13 

9 

14 

8 

12 

0 

1 

2 

5 

3 

11 

S 

13 

1 

8 

7 

16 

5 

12 

13 

o 

2 

7 

2 

7 

12 

15 

4 

11 

1 

8 

12 

11 

14 

15 

3 

1 

9 

8 

11 

13 

9 

6 

5 

16 

3 

14 

12 

7 

14 

5 

3 

8 

10 

10 

14 

12 

11 

4 

12 

12 

4 

1 

12 

4 

15 

10 

2 

3 

5 

11 

16 

3 

2 

9 

10 

7 

14 

15 

7 

2 

8 

5 

13 

13 

10 

13 

15 

9 

1 

8 

5 

13 

16 

8 

6 

15 

7 

1 

3 

7 

2 

10 

14 

15 

3 

6 

4 

6 

6 

9 

11 

16 

11 

2 

4 

2 

2 

3 

10 

11 

13 

S 

11 

6 

9 

14 

10 

14 

11 

3 

4 

14 

8 

1 

35 

3 

11 

13 

10 

1 

9 

9 

1 

9 

2 

1 

15 

5 

13 

13 

6 

5 

12 

15 

5 

11 

12 

9 

12 

4 

8 

1 

7 

7 

7 

10 

10 

16 

8 

10 

7 

7 

15 

14 

14 

5 

4 

2 

3 

6 

12 

13 

6 

2 

8 

15 

6 

10 

5 

16 

16 

3 

4 

4 

12 



15.5 TABLES OF RANDOM PERMUTATIONS 483 

TABLE 15.7 Permutations op 16 (Continued) 


9 

2 

4 

7 

11 

8 

9 

12 

11 

3 

13 

1 

7 

8 

2 

3 

5 

5 

12 

7 

8 

5 

16 

15 

16 

15 

16 

13 

15 

11 

5 

6 

15 

3 

6 

2 

10 

11 

13 

13 

12 

8 

9 

2 

3 

9 

6 

3 

3 

1 

1 

12 

14 

14 

1 

14 

4 

7 

5 

9 

14 

7 

5 

11 

10 

16 

1 

6 

4 

6 

10 

13 

6 

13 

15 

1 

15 

14 

8 

15 

0 

3 

11 

4 

8 

12 

8 

10 

2 

4 

7 

4 

10 

10 

7 

7 

7 

15 

14 

16 

3 

10 

1 

10 

12 

10 

12 

8 

7 

12 

4 

11 

2 

12 

5 

4 

13 

4 

1 

5 

10 

10 

3 

6 

9 

11 

14 

1 

9 

2 

11 

14 

S 

9 

13 

13 

3 

2 

6 

8 

15 

9 

13 

1 

14 

0 

2 

9 

10 

14 

2 

15 

12 

5 

4 

5 

11 

16 

10 

10 

13 

15 

5 

7 

2 

14 

16 

4 

9 

6 

o 

9 

9 

8 

15 

11 

10 

9 

2 

3 

11 

1 

12 

11 

13 

16 

14 

3 

7 

7 

12 

3 

4 

5 

14 

10 

11 

15 

15 

14 

6 

5 

13 

1 

3 

7 

12 

32 

16 

8 

7 

13 

10 

2 

8 

5 

15 

14 

3 

9 

15 

12 

6 

G 

9 

15 

3 

5 

13 

1 

5 

S 

1 

16 

10 

2 

7 

11 

10 

4 

9 

10 

S 

15 

0 

13 

9 

8 

11 

15 

1 

2 

1(5 

4 

5 

3 

2 

16 

4 

2 

3 

6 

3 

14 

11 

8 

5 

13 

5 

5 

8 

4 

2 

12 

12 

4 

1 

7 

8 

16 

16 

16 

11 

10 

10 

12 

6 

2 

6 

12 

4 

11 

7 

13 

1 

9 

13 

6 

12 

13 

10 

14 

15 

3 

7 

1 

4 

1 

14 

10 

14 

7 

14 

9 

4 

C 

8 

10 

1 

11 

10 

4 

6 

16 

9 

2 

5 

11 

10 

5 

6 

12 

12 

2 

15 

16 

3 

14 

9 

11 

13 

5 

14 

7 

12 

3 

1 

4 

6 

1 

8 

2 

16 

6 

13 

6 

12 

7 

2 

12 

5 

9 

9 

12 

16 

1 

6 

36 

13 

3 

10 

15 

10 

11 

1 

5 

4 

8 

1 

15 

14 

8 

8 

15 

8 

9 

9 

G 

8 

16 

1 

10 

11 

13 

2 

4 

7 

2 

7 

13 

3 

13 

3 

3 

o 

S 

10 

15 

14 

7 

4 

10 

5 

10 

7 

13 

8 

1 

3 

6 

9 

1 

1 

5 

14 

14 

15 

3 

15 

10 

7 

0 

4 

14 

11 

11 

11 

13 

5 

8 

11 

11 

7 

9 

3 

7 

10 

10 

32 

4 

15 

14 

13 

8 

4 

10 

2 

5 

16 

2 

12 

3 

2 

1 

10 

12 

13 

9 

13 

9 

2 

7 

15 

4 

6 

15 

14 

12 

4 

14 

15 

9 

15 

13 

16 

4 

10 

5 

15 

4 

7 

4 

11 

4 

2 

3 

11 

1 

14 

11 

12 

10 

1 

11 

14 

1 

7 

11 

5 

10 

10 

14 

9 

8 

1 

11 

8 

2 

16 

15 

10 

3 

10 

9 

13 

14 

1 

9 

12 

3 

2 

15 

8 

6 

10 

15 

5 

3 

7 

12 

8 

13 

4 

1 

3 

10 

12 

16 

10 

1 

14 

12 

14 

10 

7 

8 

4 

10 

1 

14 

11 

11 

13 

IS 

6 

16 

14 

13 

13 

7 

5 

6 

3 

7 

9 

12 

9 

12 

9 

13 

13 

7 

2 

2 

12 

13 

6 

6 

2 

6 

9 

8 

7 

5 

15 

6 

16 

7 

4 

9 

1 

2 

12 

3 

5 

7 

2 

15 

11 

16 

6 

1 

6 

16 

11 

5 

3 

14 

6 

8 

3 

5 

16 

14 

8 

9 

13 

4 

8 

5 

4 

16 

5 

12 

4 

2 

15 

8 

3 

2 


10 

8 

11 

14 

16 

2 

13 

2 

2 

15 

9 

7 

4 

2 

5 

11 

1 

10 

10 

3 

5 

12 

14 

7 

15 

1 

12 

16 

16 

12 

3 

6 

9 

16 

4 

13 

6 

15 

3 

5 

4 

9 

3 

4 

13 

7 

10 

4 

7 

n 

16 

14 

1 

15 

7 

16 

fi 

5 

1 

7 

2 

10 

2 

8 

1 

0 

3 

13 

12 

8 

13 

3 

6 

5 

3 

8 

14 

3 

13 

1 

6 

15 

10 

11 

11 

4 

7 

1 

14 

2 

7 

1 

13 

9 

6 

15 

16 

12 

0 

4 

11 

11 

7 

12 

14 

34 

11 

14 

9 

16 

8 

5 

12 

0 

9 

9 

0 

11 

4 

13 

1 

4 

8 

3 

2 

3 

8 

8 

8 

14 

14 

16 

15 

13 

12 

12 

2 

9 

31 

31 

15 

13 

10 

10 

10 

5 

4 

7 

15 

10 

12 

2 

5 

1 

8 

10 

15 

6 

5 

9 

5 

10 

7 

35 

6 

6 

o 

9 

15 

9 

33 

11 

10 

JO 

7 

2 

9 

2 

32 

6 

8 

1 

14 

7 

33 

5 

8 

10 

9 

10 

2 

2 

2 

9 

2 

16 

6 

8 

ft 

1 

10 

S 

6 

12 

4 

15 

7 

11 

3 

ft 

14 

15 

9 

3 

15 

9 

11 

16 

0 

12 

32 

14 

4 

6 

16 

14 

10 

12 

1 

4 

9 

4 

3 

34 

9 

12 

4 

15 

2 

2 

1 

C 

5 

4 

12 

1 

12 

14 

14 

3 

4 

3 

11 

5 

34 

13 

1 

5 

10 

15 

15 

16 

13 

1 

8 

8 

5 

4 

13 

10 

13 

9 

8 

2 

13 

11 

16 

7 

4 

13 

3 

5 

1 

10 

1 

4 

14 

1 

11 

14 

6 

13 

15 

13 

3 

7 

15 

3 

7 

8 

7 

7 

16 

8 

5 

3 

3 

13 

8 

7 

16 

12 

12 

11 

10 

10 

13 

6 

10 

11 

1 

14 

2 

14 

8 

8 

13 

6 

5 

13 

8 

5 

8 

9 

9 

14 

8 

5 

7 

1 

13 

6 

9 

5 

1 

9 

1 

11 

H 

6 

16 

9 

14 

1 

13 

5 

16 

13 

1 

3 

11 

16 

3 

10 

6 

13 

15 

4 

12 

14 

12 

15 

11 

4 

7 

7 

10 

7 

3 

7 

15 

13 

10 

8 

11 

11 

3 

3 

10 

11 

6 

10 

1 

7 

2 

16 

6 

15 

6 

4 
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TABLE 15.7 Pebmtjtations of 16 ( Continued ) 
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